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ABSTRACT 

Weak lensing is emerging as a powerful observational tool to constrain cosmological 
models, but is at present limited by an incomplete understanding of many sources of 
systematic error. Many of these errors are multiplicative and depend on the population 
of background galaxies. We show how the commonly cited geometric test, which is 
rather insensitive to cosmology, can be used as a ratio test of systematics in the 
lensing signal at the 1 per cent level. We apply this test to the galaxy-galaxy lensing 
analysis of the Sloan Digital Sky Survey (SDSS), which at present is the sample 
with the highest weak lensing signal to noise and has the additional advantage of 
spectroscopic redshifts for lenses. This allows one to perform meaningful geometric 
tests of systematics for different subsamples of galaxies at different mean redshifts, such 
as brighter galaxies, fainter galaxies and high-redshift luminous red galaxies, both with 
and without photometric redshift estimates. We use overlapping objects between SDSS 
and the DEEP2 and 2SLAQ spectroscopic surveys to establish accurate calibration of 
photometric redshifts and to determine the redshift distributions for SDSS. We use 
these redshift results to compute the projected surface density contrast AE around 
259 609 spectroscopic galaxies in the SDSS; by measuring AE with different source 
samples we establish consistency of the results at the 10 per cent level (Icr). We also use 
the ratio test to constrain shear calibration biases and other systematics in the SDSS 
survey data to determine the overall galaxy-galaxy weak lensing signal calibration 
uncertainty. We find no evidence of any inconsistency among many subsamples of the 
data. 
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1 INTRODUCTION 



Recent years have seen tremendous progress in the detec- 



tion of ealaxv-ealaxv weak lensine fBrainerd et al. 


199a 


Hudson ct al. 


1998; 


Fischer ct al. 2000; Smith et al 


2004 


McKay ot al. 


2001; 


Guzik & Scliak 2002; Hockstra et al 



2003^; .Sheldon et aL,„2004: Hoekstra et al... ..2004,; Seliak et al 
2003), the tangential shear distortion around galaxies due 
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to their dark matter h alos. Recent measurements of galaxy- 
galaxy weak lensing ("S heldon et alj |2004| ; iHoekstra et alJ 
pool; Scliak ct al. 2005) demonstrate 20-30(T detections. In 
light of the increasing statistical precision with which this 
effect is measured, it is important to revisit common sources 
of systematic error, which are currently at the 10 per cent 
level and therefore already dwarf the statistical error. 

While galaxy-galaxy weak lensing is potentially a very 
powerful tool for studying the dark matter halo profiles 
around stacked foreground galaxies, it suffers from a large 
number of potential calibration biases. Because it involves 
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measuring the projected surface density contrast, 

AE(r) = 7+(r)E„ (1) 

averaged over stacked lens and source galaxies (where r is 
the transverse separation from the lens galaxy), calibration 
biases may be introduced via both the tangential shear and 
the redshifts used to compute Ec. 

Here we introduce some of the notation that is used 
in this paper to describe the weak-lensing signal. We com- 
pute the 2-dimensional projected surface density contrast of 
stacked foreground galaxies as a function of transverse sepa- 
ration from those galaxies, where this separation is measured 
in comoving coordinates, using r — rp{l + zi) (with subscript 
I referring to the lens, s to the source), and rp = OisDa{zi), 
the product of the observed angular lens-source separation 
on the sky and the angular diameter distance at the lens 
redshift. We can then measure the surface density contrast, 
which is related to the projected mass density E(r) and its 
average value inside radius r, E(< r), as follows: 

AE(r) = E(<r)-E(r) =7+(r)E, (2) 

The inverse critical surface density E^^ in comoving coor- 
dinates is defined by 

47rG DisDi{l + zif 



Ds 



(3) 



in terms of angular diameter distances, and has the following 
properties: for a given lens redshift, it is zero for < zi, 
then increases rapidly above Zs > z; until it flattens out as 
Zs ^ zi; the asymptotic value for Zs ^ zi increases with zi. 

For a full discussion of the errors that can be intro- 
duced in the shear computation, sec Hirata ct al. (2004 ), 
hereinafter H04. As shown there, several types of error can 
be introduced when computing the shear, including biases 
due to point-spread function (PSF) correction, noise recti- 
fication bias, and selection biases. These biases are consid- 
ered in detail for the linear PSF correction method used 
in that paper. In this paper, we use a different PSF cor- 
rection schem e, "r e-Gaussianization," which was introduced 
and tested in iHira ta & Scliak (2003), and include a discus- 
sion of the effects of that choice on systematic error in the 
shear. 

Because systematic errors in the shear computation 
have been well-studied, and statistical errors can be de- 
creased to a fairly low level when data from large surveys 
such as the SDSS are used, systematic errors in the redshift 
distribution (and consequently, in Ec) are of much greater 
importance than they were previously. For the purposes of 
this paper, we will assume that lens redshifts are known to 
high precision via spectroscopy, so our concern is the sourc e 
redshift distribution; as shown in lKleinheinrich et alJi2004l) . 
if the lens redshifts are also unknown, g-g weak lensing is not 
nearly as powerful a tool. Several common methods of de- 
termining the source redshift distribution are inadequate for 
precision cosmology due to previously unquantified system- 
atic uncertainties that they introduce via their effects on 
EJ^; this paper includes a study of the biases that may be 
introduced using these methods, and a comparison against 
a well-determined reference distribution. 

In addition to biases introduced due to the redshift dis- 
tribution, we also discuss systematics that can be introduced 
while computing the signal. These include biases due to in- 
trinsic alignments, selection effects, and several other effects. 



One new effect not noted before is a problem with the de- 
termination of the sky flux near bright objects by the SDSS 
Photo pipeline that leads to the problem in detection of 
sources within about 90" of bright objects ( i|6.3.7ll . This 
problem affects any galaxy-galaxy or cluster-galaxy weak 
lensing analysis using SDSS data. 

Many of the sources of systematic errors discussed 
above are common to both weak lensing auto-correlation 
analysis and galaxy-weak lensing correlations (galaxy-galaxy 
lensing). Weak lensing auto-correlation analysis at present 
is limited by t he statistical preci sion (see summaries in 
lRefregieill2003bl and lHoekstral2003fl and it is diflicuh to test 
for the presence of systematics within each data set. How- 
ever, the size of these data sets is rapidly increasing, and in 
the near future systematic errors are likely to dominate the 
statistical errors. Understanding of weak lensing systematics 
is essential if one is to exploit the full potential of upcoming 
and planned surveys such as the CFHT Legacy survey 
( http://www.cfht.hawaii.edu/Science/CFHLS McUicr 
2001) , Pa n-St arrs ( ht tp://pan -sta TTS.ifa .hawaii.edu/ , 
Kais er et alj l200l iKaiseJ l2004l). LSST 
{http://www.lsst.OTg/lsst_home.html iTvsonI |200^, 
and SNAP ( http : / /snap.M.gov/ iRhodes et al.l |2004| . 
iMassev et all 120041 iR.efreeier et^ |22^7"someof which 
may reach statistical precision at the 0.1 per cent level. 

Many of our systematic tests 



are done usin; 
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Jjain & Tavlorl 


l2003l. 



dark energy using the fact that AE is an invariant of the 
projected lens mass distribution, and therefore must be the 
same when measured with two different source samples at 
different redshifts. The use of this fact to test the dark en- 
ergy density and equation of state requires control of system- 
atics at the 0.1 per cent level. Since systematics in weak lens- 
ing are currently only constrained at the 10 per cent level, 
we turn this test around to use AE measured with reference 
samples to check for systematic error, knowing that cosmol- 
ogy plays a negligible roll in the comparison. If the source 
samples being compared vary in quantities affecting both 
the shear computation and the redshift distribution (e.g., if 
one sample is more distant, with lower S/N shape measure- 
ment and less well-known redshift distribution), then the 
systematics in both quantities may be different as well, and 
we can only use this method to test the overall calibration of 
the signal rather than the shear calibration, redshift distri- 
bution, and other effects separately. Fortunately, the SDSS 
now covers a large enough area that there is significant sta- 
tistical power for such tests. 

In ^ we describe the data acquisition, selection cri- 
teria, and processing. The common redshift distributions 
used for weak lensing analyses are discussed in |3] includ- 
ing specifically how these methods are implemented in this 
paper. Additional systematics issues introduced in the com- 
putation of the lensing signal are described in [21 Our im- 
plementation of the test for systematics is described in 
and the results are given in !JS| Finally, the implications of 
these tests are discussed in [Q 

Here we note the cosmological model and units used 
in this paper. All computations assume a fiat ACDM uni- 
verse with Q,m = 0.3 and SIa = 0.7. Distances quoted for 
transverse lens-source separation are comoving (rather than 
physical) /i~'^kpc, where Ho = 100/i km/s/Mpc. Likewise, 
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AS is computed using the expression for Sj^ in comoving 
coordinates, Eq. |21 In the units used, Ho scales out of ev- 
erything, so our results are independent of this quantity. All 
confidence intervals in the text and tables are 95 per cent 
confidence level (2(t) unless explicitly noted otherwise. 



2 TECHNICAL APPARATUS 

In this section, we describe the data used for our com- 
putation of the lensing signal. The source of this data is 
the Sloan Digital Sky Survey (SDSS), an ongoing survey 
that will eventually image approximately one quarter of the 
sky (10,000 square degrees). Imaging data is taken in drift- 
scan mode in 5 filters, ugriz, centered at 355, 469, 617, 
748, and 893 nm r espectively (Jukugita ct aL .1996) using 
a wide-field CCD iCunn et alj ll9 9^ After th e computa- 
tion of an astrometric solution jPier et al.ll2003ll . the imag- 
ing data are processed by a sequence of pipelines, collec- 
tively called Photo, that estimate the PSF and sky bright- 
ness, identify objects, and measure their properties. The 
software pipeline and photom etric quality assessment is de- 
scribed in llvezic et al.l ^2004^ . Bright galaxies and other in- 
teresting objects are selected for spectroscopy according to 
specific criteria llEisenstein et alJboOll : IStrauss et aljIiooS 
iRichards et alJl2002h . The SDSS has had four major data 



Lens redshift 



relea ses: the Early Data Release or EDR ( Stoughto^^^l 
| 2002[) . DRl ("Abazaiian et al."200^, DR2 jAbazaiian et al 
|200J), and DR3 ( Abazaiian et al. 200l)- 

While we use imag- 
ing data more up-to-date than DR3, we are limited in area 
by the spectroscopic coverage available to us because spec- 
troscopy lags significantly behind photometry. 



2.1 Lens catalog 

The lens (foreground) galaxies used for this study are 
included in the SD SS main galaxy spectroscopic sample 
JStrauss et alj |20o3), as part of the NYU Value-Added 
Galaxy Catalog CVAGC. iBlanton et al1l2004f) . though the 
version of the VAGC used he re includes mo r e are a than 
the public one described in iBlanton et al.l l|200^. The 
VAGC is use d because of its consistent overall calibration 
jSchleeel et al . 2005a). The sample, after redshift and mag- 
nitude cuts to be described below, includes 259 609 galax- 
ies (decreased from 314 906 after exclusion of the southern 
Galactic region). We only use lenses at redshift z > 0.02 
because of the computational expense of computing pairs 
out to 2 /i~^Mpc for lenses at lower redshifts, and because 
the lower redshift galaxies have low and therefore con- 
tribute little weight. Furthermore, for this study, we only use 
galaxies with r-band Petrosian absolute magnitude — 23 5S 
Mr ^ —17, divided into six magnitude bins, each one mag- 
nitude wide. The spe ctra used were process ed by a separate 
pipeline at Princeton JSchleeel et al.l2005d) . The fiuxes were 
extinc tion-corrected using dust maps from ISchleeel et alJ 
il998h . then fc-corrected to z = 0.1 using kcorrect v1_11 
with values given directly in the VAGC catalog. 

This sample is approximately flux-limited to Petrosian 
apparent magnitude r — 17.77; all absolute magnitudes 
for the lens sample used in this paper are Petrosian r- 
band magnitudes. Redshift-evolution of luminosity consis- 




0.2 
Redshift z 



Figure 1. The redshift distribution for lens galaxies, shown over- 
all and for the 6 luminosity bins. As shown, brighter samples peak 
at higher redshifts. The cutoff at 2 = 0.02 was imposed artificially 
on the sample. Solid points are used to indicate the weighted mean 
redshift of each lens sample. 



Table 1. For each luminosity bin, the number of lens galaxies, 
(z) and cr{z) (a characteristic width, though the distribution is 
not Gaussian), the mean weighted redshift z^ff, and the mean 
weighted luminosity L(,ff relative to L». 



Sample, Air 


ranf 


;c Ngal 


(^) 


a(z) 


^cff 




Ll,[-18,- 


17) 


6 524 


0.032 


0.011 


0.032 


0.080 


L2, [-19,- 


18) 


19 192 


0.048 


0.015 


0.047 


0.20 


L3, [-20,- 


19) 


58 848 


0.074 


0.021 


0.071 


0.49 


L4, [-21,- 


20) 


104 752 


0.11 


0.03 


0.10 


1.2 


L5, [-22,- 


21) 


63 794 


0.16 


0.05 


0.14 


2.5 


L6, [-23,- 


22) 


6 499 


0.22 


0.06 


0.19 


5.6 



tent with lBlanton et all (|2QQ^3) was used, so that the abso- 
lute magnitude used for all cuts was 



Afr(used) = Mr(measured) + 1.6{z ~ 0.1) 



(4) 



The effect of this shift is to include higher luminosity lenses 
with A'/r(mecisured) < — 23 in the brightest luminosity bin 
because their (z) is greater than 0.1, and to include fainter 
lenses in the faintest bin, because their (z) is approximately 
0.03. 

Fig.0shows the lens redshift distribution. The Mr lim- 
its, mean redshifts, and widths of the distribution for the 
6 luminosity bins used here are shown in tabled as is the 
mean effective redshift (taking into account the weights used 
for the computation of signal) and mean eff ective luminos- 
ity rel ative to L* (with M« = —20.44 as in IBlanton et al] 
l2003a^ since they are more relevant for the lensing signal. 
Note that ZcB < (z) because the larger number of pairs for 
lower redshift lenses (due to the larger angular scale asso- 
ciated with the fixed transverse comoving scale) overcomes 
the fact that for a given source redshift increases with 
lens redshift. 
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One important fact about the lens samples L1-L6 is 
that they are flux-limited, not volume limited. As a result, 
since AE is averaged over lens-source pairs with a weight 
proportional to Tt'^'^ {zi, Zs), the mean effective redshift and 
luminosity may vary depending on the redshift distribution 
of the source sample. When comparing AE for a given lens 
sample but different source samples, we explicitly computed 
the mean effective redshift z^ff and luminosity Lcff of each 
lens sample with each source sample (representative values 
are given in table 0. Variations in 1 -I- ZcS and Lcff for the 
same lens sample but different source sample were found 
to be quite small, a maximum of 2 per cent; because this 
variation is so small, and because we lose significantly in 
statistics by going to a volume limited sample, we choose to 
keep the full flux-limited sample and, when necessary, apply 
corrections to the computed AE when comparing between 
different source subsamples. Corrections will be described 
further in g^l 



2.2 Source catalog 

2.2.1 Constructing the catalog 

The source sample consists of galaxies select e d from the 



SPSS photometric catalog llYork et alJ l2000l: iHogg et al 
20011 : IStoughton et alJ l200l ISmith et al.l 120021: IPier et al 



2003 ; l^^azeHiai^^dl 2003r . The catalog contains informa- 



tion about the images from the SDSS camera iGunn et alJ 
Il998l) processed at Princeton by the Photo software 
jLupton et all200ll:lFinkbeiner et al.l2004l) . rerun 137. Note 
that for the source catalog, we use the model magnitudes in 
all 5 bands rather than Petrosian magnitudes, because of 
their higher signal-to-noise for fainter galaxies. 

The source catalog used for this work is not the same 
as that used in H04. Here we describe the catalog used, 
emphasizing differences from the previous one. The most 
trivial difference is the size of the dataset: here we use imag- 
ing data acquired from 1998 September 19 (run 94) through 
2004 June 15 (run 4682), whereas the H04 catalog did not in- 
clude imaging data acquired after 2003 March 10 (run 3712); 
but there are also significant changes in the pipeline. There 
are several steps involved in the development of the cata- 
log: (1) object selection based on Photo outputs, (2) shape 
measurement, and cuts on the shape measurement, (3) other 
cuts, and (4) organization. 

We begin by describing basic object selection start- 
ing from the Photo outputs. Star/galaxy separation 
was accomplished using the Photo pipeline output 
OBJC_TYPE^, and the cut on resolution factor described 
below should further reduce stellar contamination. We defer 
discussion of possible stellar contamination to H4.3I Unlike 
the catalog used for H04, this catalog includes deblended 
child galaxies. Because the deblender has been significantly 



^ OBJC_TYPE classifies objects as "galaxies" if the flux esti- 
mated from the linear combination of de Vaucouleurs and expo- 
nential profiles (composite model magnitude, or cmodel magni- 
tude) fit to the object exceeds the flux estimated from the best-fit 
PSF by at least 0.24 magnitudes. This works because the profile 
fit will pick up more of the light from an extended object than 
the PSF fit. At faint magnitudes (r > 21) this separation scheme 
mistakes some stars as galaxies; see i|4.3l 



improved for DR2, the phenomenon noted for EDR and DRl 
that the deblender so metimes "shreds" large galaxies rarely 
occurs (according to lAbazaiian etlill ijioOJ), inspection of 
several hundred deblends indicates that they are correct 
roughly 95 per cent of the time). While shape measurement 
was performed for all galaxies brighter than magnitude 22.0 
in r-band and 21.6 in i-band (no requirements on detection 
in g-band), these cuts were applied using model magnitudes 
before the extinction correction. Several other cuts on the 
Photo flags were performed: the galaxy must have been de- 
tected in unbinned images in the r and i bands; also, several 
flags indicating problems in shape measurement or problems 
with the image (e.g., interpolated pixels) must not have been 
set. 

Next, we describe the shape measurement determina- 
tion. The PSF-correction algorithm used for this work was 
the "r e-Gaussianization" scheme described and tested in 
iHir^^ fc Seliak (2003). Recent SDSS lensing works, includ- 
ing ISheldon et al.l ll2004h and H04, have used the linear 
schem e described there, but as shown in llfirata fc SeliakI 
i2003h . re-Gaussianization is much more successful at avoid- 
ing various shear calibration problems, reducing them to the 
several per cent level (rather than ~10 per cent) even for 
poorly-resolved galaxies. Unlike the linear scheme, which in- 
volves correcting the measured adaptive moments of the im- 
age by factors involving the adaptive moments of the PSF, 
re-Gaussianization involves fitting the PSF shape to a Gaus- 
sian, and using the deviations of the PSF from Gaussianity 
in the PSF correction. The re-Gaussianization method was 
implemented by reading the atlas im ages and the PSF maps 
from Photo (Stoughto n et al]l2002l) . since it is impossible 
to implement using the object catalogs alone. 

Re-Gaussianization is a perturbative PSF correction 
scheme based on the observation that if the PSF P and 
the pre-seeing galaxy image / are Gaussians, and have co- 
variance matrices M'^' and M'-'^', then the observed image 
I — P igi f (here ® represents two-dimensional convolution) 
is a Gaussian of covariance M(/)-^M(-P'. A simple PSF cor- 
rection scheme is thus to find the covariance matrices M'^^ 
of the PSF and M^^-* of the observed galaxy image, and es- 
timate 



M 



{/) 



M 



(I) 



M 



(5) 



In practice, galaxy shapes are not perfectly Gaussian but 
one can fix this by finding A, xi, and M*-^' that minimizes 



\l{x) ~ Ae 



(6) 



The covariance matrix M'^' so obtained is known as the 
"adaptive" covariance matrix, and its trace T^^-* is known as 
the "adaptive" trace. In principle one can evaluate Eq. 
and then estimate the galaxy ellipticity 



(/) Jf)\ 



(7) 



In practice, Eq. Q does not work very well for real PSFs 
and galaxies - both PSF and galaxy tend to have sharper 
central peaks and wider tails than Gaussians with the adap- 
tive covariance - and thus two correction s are m ade. The 
first correction, due to lBernstein fc JarvisI i2002l) . accounts 
for the non-Gaussianity of the galaxy. If the PSF is circular. 
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Eq. ||7J reduces to 



J/) 



(I) 



(8) 



where 7?, 



(.1) 



1 - is the resolution factor. 



iBernstein fc Jarvij ll2002h worked to first order in T^^' for 
a non-Gaussian galaxy, and found that Eq. still applies, 
but with R^^ replaced by the non-Gaussian resolution factor 



(9) 



where a'P is the dimensionless kurtosis of the galaxy, defi ned 
to be zero for a Gaussian (see IBernstein fc Jarvi 3 l2002l for 
a precise definition). This equation can be generalized to an 
elliptical PSF by requiring SL(2, R) shear invariance, and is 
found to work well for Gaussian PSFs in "toy" simulations 
jHirata fc Seliakll2003^ . 

The second correction required is for the non- 
Gaussianity of the PSF. This correction begins by find- 
ing the Gaussian G{x) that best fits the PSF P{x) ac- 
cording the unweighted least-squares method, i.e. minimiz- 
ing J \G — d^x. It then re-normalizes G to integrate to 
unity - a condition not always satisfied by the best-fit Gaus- 
sian even though f P{x)d^x = 1 - and finds the residual 
e{x) = P{x) — G{x). Next the pre-seeing image of the galaxy 
is approximated by a Gaussian fo{x) whose covariance ma- 
trix M'^°' is obtained by subtracting the adaptive covariance 
matrices M'^"' = M'^' — M'^', and a "re-Gaussianized" im- 
age I' — I — e (gi fo is constructed, which is supposed to 
approximate what would have been observed had the PSF 
been Gaussian. The IBernstein fc JarvisI (jioO^) prescription 
is then applied to /' using i?2^ '(NG). 

The re-Gaussianization scheme is exact to first order in 
PSF non-Gaussianity, however higher-order approaches have 
been proposed based on expansions of the gal axy and PSF 
in or thogonal functions (Refrcgicr 2003a; Berns tein fc Jarvi3 
[2002). Suggestions include direct fitting of the convolved 
galaxy to the data iBernstein fc Jarvi!j|20o3l or deconvolu- 
tion reg ularized by a cutoff in th e orthogonal function ex- 
pansion llRefregier fc Baconlliooll . These methods have not 
yet come into general use, but are likely to become more 
widely used in the future due to the demanding calibra- 
tion requirements of cosmic shear surveys. However, we note 
that galaxy-galaxy lensing is a promising "testing ground" 
for these methods since the same systematics tests that we 
use to test redshift distributions in JS] could also be used to 
study the relative calibrations of the various PSF correction 
methods. Such tests could be done independent of redshift 
distribution information by using the same sets of sources 
for a comparison of the shear jt computed from ellipticities 
determined by each PSF-correction method. 

Only galaxies passing certain cuts on the shape mea- 
surement were included in the catalog. The shape mea- 
surements used were the average of those in the r and 
i bands; the u, g, and z bands were not used because 
their lower signal-to-noise did not justify the large compu- 
tational expense of performing the re-Gaussianization. To 
eliminate galaxies that may cause large noise-rectification 
bias, to avoid the untrustworthy results of PSF correction 



when the galaxy is unresolved, and to help with star-galaxy 
separation, we only include galaxies with resolution factor 
R2 > 1/3; R2 here is defined as 



i?2 



(10) 



where t'^^ is the trace of the moment matrix for the PSF 
and r'^ -* is that quantity for the re-Gaussianized galaxy 
image. Note that this cut is only applied in the bands used 
for shape measurement, and since we only attempt shape 
measurement in r and i, it is possible that R2 < 1/3 in 
the other bands (indeed, the object may not even be visible 
in u, g, or z). We require that the galaxy pass this cut in 
both bands, since if we only require it in one band, then we 
can create a selection bias by preferentially using the shape 
measurement that has R2 > l/?> over that with R2 < 1/3. 

Once shape measurement was complete, several other 
types of cuts were applied. To ensure a relatively uniform 
source sample across the survey area and to avoid regions 
near the Galactic plane, only regions in which the extinction 
was lower than 0.2 magnitudes in r-band wer e used. The ex- 
tinction was determined using dust maps from lSchleeel et al] 
( 1998.) . To ensure a uniform sample, we also require r < 21.8 
(extinction-corrected) . 

The shape error estimates are in principle used for 
three purposes: weighting, determination of the shear re- 
sponsivity, and determination of the error bars on final quan- 
tities such as AE. We obtain the errors due to Poisson fluc- 
tuations in the sky and CCD dark current via the simple for- 
mula (appropriate for Gaussians; IBernstein fc JarvisI 1200^ 



CTe (sky + dark) 



R2F 



(11) 



where n is the sky and dark current brightness in photons 
per pixel, cr'^' * = det M'^' is the size of the galaxy in pixels, 
and F is the flux. This equation is crude, but for the pur- 
poses of weighting there is no need for high accuracy, and the 
error bars on our results are computed via analytic, random 
catalog, and bootstrap methods that depend on the actual 
dispersion of the ellipticities, including shape noise, rather 
than CTe itself. (The responsivity determination is addressed 
in H2.2.2I ') There is also a contribution to the shape mea- 
surement uncertainty due to Poisson noise from the galaxy 
itself; this is given by (again for Gaussians) 



cre(gal) = 7?2 



64 

27iVe 



(12) 



where Ne is the number of photoelectrons from the galaxy. 
This is only signiflcant for galaxies bright compared to the 
sky; the typical sky brightness is 21.0 mag arcsec"^ in r 
band and 20.3 mag arcsec"^ in i, and even a poorly resolved 
(i?2 = 1/3) galaxy usually has a full width at half maximum 
of ~ 1.7 arcsec after seeing, so sky brightness dominates for 
r 20.0 and i ^ 19.3. Since the shape noise is dominant 
over measurement noise for the brighter objects, we have 
not included the galaxy noise (Ea. ll2ll in our weighting, nor 
has it been included in the adaptive moment errors from 
Photo. 

Some additional cuts were designed to eliminate regions 
with faulty data. These cuts eliminated less than 1 per cent 
of the data total. First, the mean ellipticities (ei) and (62), 
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and their rms deviations, were computed on a run/camcol 
basis. Those few run/camcols that had |(e)| > 0.05 (for ei- 
ther ellipticity component, in either band) were excluded 
from the analysis; visual inspection of several of those runs 
showed severe PSF anisotropy (despite a reasonable PSF 
FWHM) for which our correction scheme was unable to ac- 
count. Furthermore, based on the distribution in rms el- 
lipticities, those run/camcols with values less than 0.38 or 
greater than 0.52 were excluded (the mean value was 0.45, 
higher than the expected shape noise since all galaxies, 
even those that had significant measurement error, were 
included). Those with rms ellipticities above the accept- 
able range typically were imaged in particularly poor seeing, 
which led to greater noise in the PSF-corrected ellipticities. 
Within runs that were accepted, galaxies with total elliptic- 
ity = -1-62 > 4 were rejected. Finally, those galaxies in a 
small region (~ 4 square degrees) that had faulty astrometry 
were eliminated. 

Once these cuts were applied, a few final steps were 
necessary to make the catalog useful. In the case of multi- 
ple observations of the same galaxy, only one observation 
was used, that which was taken in better seeing. The shape 
measurements in the two bands were combined, weighting 
by the (S/N)'^ of the detection in each band. 

Unlike in H04, photometric redshifts were assigned to 
all extinction-corrected r < 21 galaxies using a tem plate- 
based program kphotoz v3_2 l|Blanton et al J '2003c''). The 
performance of these photometric redshifts will be described 
in H6.ll Approximately 4 per cent of galaxies with r < 21 
had failed photometric redshift determination, and so were 
not used for any analysis requiring the use of photometric 
redshifts. 

For reference, we include here some plots showing in- 
formation about the source sample (these plots also show 
information about the high-redshift Luminous Red Galaxy, 
or LRG sample, a subsample of the source catalog, that will 
be described in more detail in H3.4t . The magnitude distri- 
bution of sources in the catalog is shown in figure|21 The rms 
ellipticity as a function of magnitude is shown in figure El 
The plot of the average R2 as a function of magnitude, and 
of the overall distribution of R2 values, is in Fig. 21 Some 
general information about the catalog is included in table |5| 
Note that for all tests performed in this paper, we divide the 
sources into three samples: r < 21, r > 21, and high-redshift 
LRGs. 



2.2.2 Shear calibration bias 

Here, we list the sources of shear calibration bias that were 
described in detail in H04, and estimate their magnitudes for 
the source catalog used here; refer to that paper for more 
detail about estimated shear calibration uncertainty. 

There are five major sources of shear calibration bias, 
as listed in H04. First, we consider the PSF dilution cor- 
rection, the correction to the measured galaxy image to ac- 
count for the blurring due to convolution with the PSF. Un- 
like the linear PSF-correction method using in H04, which 
has significant shear calibration uncertainty due to this ef- 
fect for the less well-resolved galaxies, the re-Gaussianization 
method only has a few per cent shear calibration uncertainty 
jHirata fc Seliakll2003^ even for the lower limit R2 = 1/3 
studied in that paper. For this paper, a plot of the PSF 



Table 2. Source catalog properties. Note that only about 65 per 
cent of the sources are in regions with spectroscopic coverage, 
and hence only these are used in computing AS. Also, because we 
require a successful measurement in both bands, only the number 
measured in both r and i is relevant. 



OKy coveidge, j sky 




0.176 


Successful measurements 

[ all rr^^lavir^Qi 


r or i 

i band 
r and i 


39 436 326 

35 344 893 
31 697 869 


Successful measurements 
(r and i) 


r < 21 
21 < r < 21.8 


18 709 472 
12 988 397 


Source density 
(all galaxies) 


r or i 
r and i 


1.51 arcmin"^ 
1.21 arcmin"'^ 


T» 1 J_ ■ IT J / 7~> \ 

Hesolution lactor {H2) 
(mean ± std. deviation) 


r band 
i band 


0.61 ± 0.15 
0.60 ±0.16 


Mean magnitude 
(extinction corrected) 


(r) 
(i) 


zu.uo 
20.22 


High-redshift LRGs, r and i 
successful measurements 
source density 
mean redshift 
mean resolution factor 

mean magnitude 


(R2{i)) 
(r) 

(i) 


2 884 242 
0.11 arcmin^^ 
0.55 
0.58 
0.55 
20.93 
20.03 


r model magnitude distribution (source sarr 


pie) 




r model magnitude 

Figure 2. The r-model magnitude distribution for the source 
catalog, shown for the full catalog (solid) and high-redshift LRGs 
(dotted line). The turnover at faint magnitudes occurs because 
we lose many sources at the faint end due to our selection criteria 
for the shape measurement (e.g., the cut on -R2). 
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PSF dilution correction 




_5 I , , , 1 , 1 _5 I , , , 1 , 1 

0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 

Resolution factor Rg Resolution factor Rg 



Figure 5. PSF dilution correction for tlie two types of galaxy profiles, as a function of R2, for different values of source cllipticity (with 
our cut at R2 = 1/3 shown as a vertical dotted line). For the de Vaucouleurs profile, the usual Gaussian relation R2 = 1/(1 + (t(^) ^ /a^^^ ^) 
breaks down - the Gaussian ct^^) ^ fits the central cusp of the de Vaucouleurs profile and misses much of the light. This is why the values 
for, e.g., R2 = 1/3 shown here disagree with the ct'^' ^/ct'^' ^ = 2.0 curves in lHirata fc SeliakI l2003l) . 




Figure 3. The rms ellipticity (for a single cllipticity component, 
averaged over both bands) as a function of magnitude, shown for 
all sources, and for the high-redshift LRG sample. We show the 
results both with and without measurement noise (including noise 
from the galaxy itself), as labeled. The increase in the result with 
noise at faint magnitudes shows the effects of noise in the shape 
measurement, but it seems that the increase in the result without 
noise at faint magnitudes may indicate a real trend in the rms 
ellipticities of the galaxies with magnitude. 



dilution correction as a function of R2, for both exponen- 
tial and de Vaucouleurs profiles, for various values of source 
ellipticity, is shown in Fig. |3 

We model the calibration error due to the PSF dilution 
correction as due to a sum of contributions from exponential 
profile and de Vaucouluers profile galaxies, 

t) ■ 

exp ^ ' / deV 

where fexp and fdev are the weighted fractions of the two 
types of galaxies with fexp + fdeV = 1. A lower bound on 



(57 _ / (57 



+ fd. 



Figure 4. The top plot shows the average R2 value for the 
source catalog as a function of magnitude for all sources (solid) 
and high-redshift LRGs (dashed). As expected, {R2) is higher for 
brighter galaxies, indicating that their shapes are better-resolved. 
The bottom plot shows the distribution of R2 values. The plot 
is shown for the r-band R2; results are nearly identical for the 
i-band R2. 



57/7 can be obtained from Fig. |S]by noting that ^7/7 ^ 
—0.014 for the exponential profile and Sj/^ ^ —0.035 for 
the de Vaucouleurs profile, hence 



^ ^ -0.014/,,p - 0.035/deV. 

7 



(14) 



For the upper bound, we repeat this calculation, except that 
we use the S'y/'y corresponding to R2 — 1/3 and elliptic- 
ity equal to \f2ermB (since this is the rms total ellipticity). 
This calculation is conservative since most galaxies are at 
Ri > 1/3 where ^7/7 is less (at fixed e). (We find that R2 
and are almost completely uncorrelated after the noise 
(Te is subtracted from for each of our three samples, and 
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for both the de Vaucouluers and exponential sources within 
each of these samples.) The results are shown in Table |21 
these should be interpreted as 2a bounds, since it is likely 
that some cancellation between positive and negative dilu- 
tion galaxies occurs. 

Next, we consider errors in PSF reconstruction, which 
can arise if the PSF ellipticity or trace are misestimated by 
the Photo PSF pipeline. This error was considered by H04, 
which showed that the PSF trace is accurately reconstructed 
to within |(5r(^Vr'^'| < 0.03. We can use Eq. (20) of H04 
to estimate the resulting calibration error; we find weighted 
averages {R^^) = 1.71, 1.81, and 1.90 for the r < 21, r > 
21, and LRG samples'^ respectively, resulting in calibration 
errors of ±2.1 (r < 21), ±2.4 (r > 21), and ±2.5 per cent 
(LRG). 

We also must be concerned about shear selection bias, 
the preferential selection of galaxies at low or high elliptic- 
ity. Considering that figure El shows clear evidence for evo- 
lution of Crma wlth magnitude, we do not estimate selection 
bias using the method from §3.2.3 of H04, which assumes 
no evolution of rms ellipticity with magnitude. An alterna- 
tive method of estimating the shear selection bias utilizes a 
simple model of the selection criteria. This method is not 
dependent on assumptions about the evolution of the ellip- 
ticity distributions. The main selection criterion that can be 
influenced by shear is the resolution factor cut, which favors 
highly elongated galaxies, since these have a larger trace T'^' 
after PSF convolution than a circular galaxy with the same 
area. We model this by noting that for a Gaussian galaxy 
and PSF, the resolution factor obeys 



i?2 = 1 - 



r('P)±2o-(«Vl-e(/)2' 



(15) 



where 



V det M'^' and e'^^ is the ellipticity of the pre- 
seeing galaxy image. A gravitational shear along the a;-axis 
leaves cr^-^^^ fixed but changes e'^' according to 



Ae<^' = 2(1 



Af) 



J/)2 



(16) 



where 7 is the amount of the shear. Therefore the change in 
resolution factor is 



AR2 = 



dR2 



^g(/) ^ 2e^^>R2{l-R2h-r7T- 



,(/) 



(17) 



Now the effect of the _R2 > 1/3 cutoff on the mean ellipticity 
can be estimated by averaging the ellipticities of the galaxies 
that are accepted into the catalog because of the shear (the 
integrand is negative when galaxies are removed or their 



<0): 



dn 



de<^' 
de(/)dii2 

,min {I-R2 



fl2=0.33 •'O 



d(j) 



if) 



Xf), 



l(-R2,min), 



(18) 



2 The {-Rj ^> values are the same for the r and i bands to within 
±0.01 for the r < 21 and r > 21 samples. For the LRG sample, 
we find {R2 ) = 1-83 in r and 1.90 in i; we quote the i value here 
because the signal-to-noise ratio for the LRGs is typically greater 
in i and hence this measurement is weighted more heavily. This 
is certainly conservative as the error increases with increasing 
(i?2"'>- 



where (p^^^ is the position angle^ and dn/de^^^dR2 is the 
joint ellipticity-resolution factor distribution. In the last line 
we have approximated e^^^ and R2 as independent, which 
we have found to be very nearly true, and noted that the 
mean value of e^^^^ is 2e™s because the ellipticity has 2 
components. The shear calibration error due to selection at 
the i?2 cut, assuming that all galaxies are weighted equally, 
is 



Sj _ -R2,m in(l 

7 



i?2,, 



7^ 



sn(i?2 



(19) 



where TZ is the shear responsivity which is discussed further 
in subsection 12.31 Note that n(-R2,min) cannot be estimated 
from Fig. 21 because that plot shows the distribution of R2 
values in the r-band. For this calculation, the relevant quan- 
tity is the distribution of R2 values formed by choosing (for 
each source) the lower of the two R2 values, since that num- 
ber is what determines whether the object is included in our 
catalog. Furthermore, we must use the distribution of R2 val- 
ues weighted by the weights used in our lensing analysis. For 
a typical value of TZ for each sample from tl2.3l our cut value 
i?2,min = 1/3, aud Weighted values n(i?2,min) = 1.6, 2.4, and 
2.8 for the r < 21, r > 21, and LRG samples, respectively, 
we obtain a selection bias estimate d'y/^ = 0.057, 0.103, and 
0.111 for r < 21, r > 21, and LRGs. 

However, the R2 cut is not the only one that will 
cause shear selection bias; the detection requirement that 
S/N = v > 5 will lead to selection bias in the opposite di- 
rection, though the magnitude of the effect is not as great 
(see Appendix 121 which shows the calculation of the esti- 
mate as ^7/7 = —0.036, —0.066, and —0.037 for the three 
samples respectively). Consequently, we consider the 2a es- 
timate of shear selection bias to be as low as zero and as 
high as the values estimated only taking into account the 
R2 selection. 

Shear responsivity error is an error in the shear respon- 
sivity via a systematic uncertainty in 6, 7715. We use a value 
of er77is(mag) estimated from figure with different results 
used for the full source catalog and for the high-redshift LRG 
sample. (Note that in light of our shear selection bias results, 
we may consider that the increase of erTTis with magnitude 
is in part due to shear selection bias, since the average R2 is 
lower at fainter magnitudes, and therefore the selection bias 
should be more severe there. Even if this is true, it is still 
correct to use the value of Crms (mag) when computing the 
shear responsivity.) We estimate the systematic error in Crms 
assuming that the uncertainty in a^ is its primary source of 
uncertainty. To estimate uncertainty in a^, we looked at the 
southern galactic survey area, for which there are many re- 
peat observations of the same area (as many as 27 for some 
areas) that can be used to get empirical values of a^ that 
can be compared against the theoretical value derived from 
Egs. inl and Egs. 1121 We find that Crms is overestimated by 
about 0.010 {2a confidence interval [0.0,0.02]), yielding an 
estimate of shear calibration bias of [0.0, 0.017] according to 
Eq. 25 in H04. 

The final major source of shear calibration bias is noise- 
rectification bias, whereby the noise in the image leads to a 
bias in the ellipticity due to the non-linearity of the PSF 



3 Defined by e^' ± ie'/^ 
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Table 3. Shear calibration biases and other parameters for the various source samples, at the 2a level. 



Sample 


r < 21 


r > 21 


LRG 


Jexp 


0.59 


0.58 


0.33 




0.39 


0.41 


0.41 


firms (deV) 


0.38 


0.42 


0.37 


Calibration bias (per cent) 








PSF dilution 


[-2.2, +2.9] 


[-2.2, +4.0] 


[-2.8, +3.9] 


PSF reconstruction 


±2.1 


±2.4 


±2.5 


Selection bias 


[0,5.7] 


[0, 10.3] 


[0,11.1] 


Shear responsivity error 


[0.0, 1.7] 


[0.0,1.7] 


[0.0,1.7] 


Noise rectification 


[-1.0,0] 


[-3.8,0] 


[-1.2,0] 


Total 2(7 57/7 (per cent) 


[-5, +12] 


[-8, +18] 


[-6, +19] 



correction process. As shown in H04, equation 26-27 and 
Appendix C, the noise-rectification bias can be estimated as 



^ ^ Knv~^ = 4(1 - 3i?7^ + Ro^ + 2el^,)p-^ (20) 
7 



where v is the signal to noise of the detection averaged over 
bands: 



(21) 



For high-i?2 galaxies, Kn ~ —2.7, decreasing to —3.7 for 
R2 = 2/3 and then increasing to 5.3 at our i?2,min = 1/3 
(and rising rapidly at lower R2, as high as 21 for R2 = 1/4). 
For each source sample, we compute the weighted average 
value of Knv~^, to find noise-rectification bias of —0.005 
(r < 21), -0.019 (r > 21), and -0.006 (LRG). To estimate 
the 2cr error, we consider the allowed range of the noise- 
rectification bias to be equal to the magnitude of the error 
estimated above; results are shown in table |3 

Those five effects are the major sources of shear calibra- 
tion bias; there are also several minor sources, at the 0.1 per 
cent level. These include camera shear (for which we correct 
using the astrometric solutions, as described in H2.2.H . er- 
rors due to pixelization, and atmospheric refraction effects. 
We do not attempt to estimate values for these subdominant 
sources of error. The total shear calibration bias (at the 2a 
level) with the five main sources of error taken into account 
is shown at the bottom of table El for the three source sam- 
ples individually. These estimates are conservative in that 
they do not assume any distribution for these errors, allow- 
ing the actual values to add, rather than adding them in 
quadrature (which assumes some possible cancellation). 



2.3 Shear estimator 

The weighting used for this work differs from that of H04 in 
two ways. First, rather than the uniform weighting used in 
that paper, we weight by measurement error: 



equation 111! This weight is then multiplied by E^^, down- 
weighting lower redshift lenses and lens-source pairs with 
small redshift separation relative to those at large sepa- 
ration. Consequently, the weight used for a given pair is 
wi^s = '!iisE~^(zi, Zs). The shear responsivity TZ appropri- 
ate for this weighting sche me is th en computed using equa- 
tions (5-33) and (5-35) from lBernst ein & Jarvis (2002), with 
the average value for our source samples being 0.86 for the 
r < 21 sources, 0.83 for the r > 21 sources, and 0.85 for 
LRGs. 

Using these weights, the shear estimator is then 



AE = 



(23) 



(22) 



While we also tried an ellipticity -dependent weight as 
suggested in lBernstein fc Jarvi?J l)2002h to increase signal-to- 
noise, we found it had minimal effect on the errorbars, so all 
the work in this paper was done with the weighting scheme 
described above. 

2.4 Error determination 

Several methods of determining the errors on AE were used, 
each with its own advantages and shortcomings. We describe 
them here, and in t|6.2l we compare the results in order to 
determine on which to rely. 



2.4-1 Analytic computation 

Analytic expressions for AE for a giv en weight function 
may b e derived from equation (5-27) in lBernstein fc Jarvia 
Ii2002h . This method is the least computationally expensive 
method of deriving errors, but suffers from several short- 
comings. First, it gives incorrect results in the presence of 
spurious shear power in the source catalog. Second, it does 
not allow an easy way to include errors on the boost factors, 
which may be significant. Finally, it does not account for 
correlation of radial bins, which can be significant at large 
radius, where the average lens-source separation is larger 
than the average lens-lens separation, so a given source con- 
tributes to the measurement in several radial bins. 



where asN, the rms shape noise in one ellipticity compo- 
nent, was determined as a function of r model magnitude 
from Fig. 121 for the full source catalog and LRGs separately, 
and (Te is the error per component on the ellipticity from 



2.4-2 Random catalogs 

A more computationally expensive way of determining the 
errors is using random lens catalogs. In the absence of sys- 
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tematic shear, the average signal around random points 
should be zero, and the rms deviation around the mean gives 
some measure of the noisiness of the signal, and therefore of 
the errors when using the real lens catalog. However, as de- 
scribed in H04, this method is only valid on small distance 
scales, less than about 500 /i~^kpc for the faint lens samples 
and around 1 ft~'^Mpc for the brighter samples. Further- 
more, this method also cannot take into account the errors 
on the boost factors, though it can account for the correla- 
tion of radial bins. To get a reasonably smooth measure of 
the errors, a large number of random catalogs must be used; 
for this work, we used 24. 

2.4-3 Bootstrap resampling 

As described in H04, bootstrap resampling is a useful 
method of determining the errors. For this purpose, the lens 
catalog is divided up into 200 subregions, and the signal 
is computed for each subregion. The bootstrap-resampled 
datasets are generated by combining the signal from 200 
subregions with replacement. Then, a large number of re- 
samplings (for this work, 2500) can be used to determine 
the average signal and its error. This method has several 
advantages over the other two; for example, it naturally in- 
corporates errors in the boost factor, and the correlation of 
radial bins. However, because of the finite size of the sub- 
regions, the errors are once again not trustworthy at large 
lens-source separation. Also, as described in H04, the noise 
in the covariance matrix means that the values for fits 
performed using the covariance matrix do not follow a 
distribution. 



3 REDSHIFT DISTRIBUTIONS 

In this section, we first describe the many commonly-used 
methods of source redshift determination, including poten- 
tial errors. Then, we describe the "reference" redshift distri- 
butions that we use for calibration. 

3.1 Photometric redshifts 

Surveys that collect photometric information in several pass- 
bands allow for the determination of photometric redshifts, 
which use the galaxy colors to extract an approximate red- 
shift, typically based on the construction of galaxy spectral 
energy distribution (SED) templates that are evolved with 
redshift. In principle, these photometric redshifts may be 
used directly in the computation of E~^. To be accurate, 
this computation should take into account the photomet- 
ric redshift error distribution, which may be highly non- 
Gaussian, particularly in certain redshift ranges and areas 
of color space. 

We studied photometric redshifts computed by two in- 
dependent groups for the SDSS, and found that they per- 
formed statistically nearly identically (that is, photomet- 
ric redshifts for particular galaxies were not necessarily 
the same, but the mean bias and scatter were nearly the 
same as a function of magnitude and of photometric red- 
shift). One set of photometric redshifts was computed using 
the program kphotoz as mentioned in H2.2I The other set 
was computed by the SDSS photometric redshift working 



group (as dis cussed in ICsabai et ahlbOOl based on work in 
ICsabai et a'i]|2000l and lBudavari et alJl200ol) . for DRl data 
only. Because of the lack of photometric redshifts for the full 
DR2 sample, instead of using the nearest-neighbor search 
method from H04 to get photometric redshifts for the full 
sample, we demonstrate results in this paper using redshifts 
from KPHOTOZ for the full sample. 

We only use photometric redshifts for sources at r < 21, 
because of the large scatter at fainter magnitudes that will 
be demonstrated in i|6.1.2l The mean redshift of the source 
sample at r < 21 is ~0.35, with a fairly large width. This fact 
raises several concerns for use with the lens sample, which 
extends out towards the peak of this redshift distribution 
at the bright end. Lenses at higher redshift are weighted 
more highly because of their higher so that if many of 
the lens-source pairs are at small redshift separation, and 
there is a bias on the photometric redshifts, then even with 
the weighting which downweights nearby pairs, we still 
end up with a large bias in AE. Consequently, the pho- 
tometric redshift error distribution, which can be difficult 
to determine, is very important. Previous studies used the 
spectroscopic sample, which is brighter, with much lower 
photometric errors, supplemented b y the CN0C2 spectro- 
scopic redshifts down to r = 21 tC sabai et al.ll20o3) . or used 
stacked images from the southern SDSS survey due to their 
lower photometric errors (for kphotoz*). Consequently, the 
applicability of the photometric redshift errors from these 
studies to a full catalog based on fainter single images (that 
will have additional photometric redshift error due to noise) 
is questionable. 

Fortunately, as will be shown in H3.3I we now have the 
capability of studying the errors in photometric redshifts di- 
rectly for a representative subsection of our source sample 
rather than for atypical brighter or noiseless samples, and 
the application of these error distributions in the computa- 
tion of the lensing signal can help eliminate the bias in AE 
due to photometric redshift error. 

3.2 COMBO-17 distribution 

Another commonly-used option for the source redshifts is 
the use of a probability distribution rather than individual 
redshifts. This method has the disadvantage of leading to a 
larger boost factor due to the inability to weed out phys- 
ically associated pairs. However, for the r > 21 galaxies, 
distributions are a better option than photometric redshifts 
due to photometric noise; consequently, we do not consider 
photometric redshifts for r > 21. 

The redshift distribution used for this paper is derived 
from data from the Classifying Objects by Medium-Band 
Observations, or COMBO-17 survey. COMBO-17 includes 
photometry in 17 passbands spanning the wavelength range 
from 350-930 nm, yielding far more information for the de- 
termination of photometric redshifts than the SDSS. The 
area of the survey used in the study that derived the redshift 
distribution covers 0.78 square degrees, spread over three 
disjoint regions (smaller than the full survey), so while the 
area is small, the concern about the derived redshift distribu- 
tion p{z\r) being unduly infiuenced by large-scale structure 

* Michael Blanton, private communication. 
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is somewhat lessened. IWolf et alJ i2003h includes the lumi- 
nosity functions upon which these distributions were based. 
Since the distributions are for all photometric galaxies rather 
than those passing our lensing cuts, we may expect that they 
lie at slightly higher redshift than our catalog on average; 
we show results of tests of this hypothesis in H5.4.2I A plot 
of this distribution (averaged over r) is shown in comparison 
with other distributions in Fig. |7| 

Note that the COMBO-17 photometric redshifts have 
been used both directly for cosmic shear studies in the 
COMB O-17 survey itself llBrown et alj2003tlHevmans et all 
and to estimate redshift distributions for other cos- 
mic shear investigations (iHevmans et al ] |2004bl) . 

3.3 DEEP2 

Another way to determine the true redshift distribution for 
our sources is to find another survey tha t is flux-limited 
and co mplete to a desired flux. As shown in llshak fc Hirat'al 
i2005(l in the context of cosmic shear surveys, even just 100 
spectroscopic redshifts may be sufficient to m ake this de- 
term i nation. Fortunately, th e DEEP2 survev IID avi^^^ 
| 2003l. iMadgwick et al.1 1200A iDavis et alj I2004L ICoil et al, 
l2004a^ provides results that are useful for this purpose, with 
spectroscopic completeness well beyond r = 21.8, the limits 
of our source catalog. The DEEP2 survey will eventually in- 
clude spectroscopy of ~60 000 galaxies in 4 fields totaling 3.5 
square degrees. While the targeting in three fields involves 
the use of photometric information to select galaxies with 
z > 0.7, the targeting in the extended Groth strip (EGS) 
does not attempt to place such restrictions, and because 
it overlaps with the SDSS, it may be used to study redshift 
distributions and photometric errors in SDSS data. Observa- 
tions are complete in pointing one of field one (EGS), centred 
at Dec. -1-52° 12' and RA 14''15"'.7, which has area approx- 
imately 0.15 deg^ (roughly 1/4 the area in the fuU EGS). 
The Groth strip is situated > 50 degrees from the nearest 
of the three COMBO-17 fields, so the redshift distribution 
obtained from it can be considered statistically independent 
from the COMBO-17 results. The detectors on the Canada- 
France-Hawaii Telescope (used for imaging for DEEP2 tar- 
get selection) saturate at Rab ~ 17.6 at the bright end, so 
no galaxies brighter than that limit have spectra, but fortu- 
nately those galaxies constitute a very small fraction (~ 2 
per cent) of the source sample. Also, about 1 /3 of the galax- 
ies in the Groth strip were not targeted at all, which reduces 
the number of potential matches against SDSS. 

The target selection in the Grot h strip did invo lve some 
color and magnitude information (iFaber et al.. i2005il . At 
R < 21.5, all galaxies are selected uniformly; fortunately, 
the majority of the galaxies in our lensing catalog fall into 
this category. At fainter R, galaxies are classified as low or 
high redshift via color cuts; low redshift galaxies are down- 
weighted significantly, which makes the selection fairly com- 
plicated. To verify that this selection has a negligible effect 
on the results in this paper, all redshift distributions com- 
puted using DEEP2 data were recomputed taking into ac- 
count selection probability (i.e., weighting each redshift by 
1/p). The value of {E~^) for various values of lens redshift 
and with the weighted redshift distributions were computed 
and compared to the values from the unweighted distribu- 
tions. Even for lens redshift ^ 0.2, which is at the high 



end of the distribution, and should be more sensitive to the 
source redshift distributions than most other lenses, the frac- 
tional change in the value of was on the order of a few 
tenths of a per cent. Consequently, we consider the selection 
function to be of negligible importance for the distributions 
presented in this paper, and for the remainder of this work 
we use the unweighted results. 

To use the DEEP2 redshifts to compute redshift dis- 
tributions and photometric redshift error distributions for 
our catalog, we first matched between the DEEP2 spectro- 
scopic catalog in the Groth strip and our lensing catalog. 
This step ensures that all distributions that we derive will 
apply to lensing-selected galaxies, which are expected to be 
at lower average redshift than all galaxies at the same mag- 
nitude, due to our requirements on the shape measurement. 
In principle we could have used re dshift distr i butions as a 
function of magnitude presented in ICoil et alJ J2004bl) . but 
since those were derived for all galaxies, they are expected to 
be at slightly higher average redshift than the distributions 
for lensing-selected galaxies. Once matching was complete, 
there were 278 matches, 162 at r < 21 and 116 at r > 21. 
Our requirement that there be a high-quality redshift had 
eliminated 33 potential matches, giving a redshift determi- 
nation success rate of 89 per cent for lensing-selected galax- 
ies overall, or 91 per cent for r < 21 and 86 per cent for 
r > 21. 

We must be concerned about the effects of redshift fail- 
ures on our results. The lack of knowledge whether or not the 
failures lie in a particular region of redshift space (e.g. higher 
redshift on average) introduces an unknown syst ematic into 
our res ults. First, we note that as discussed in ICoil et all 
l2004aj), redshifts z > 1.45 cannot be measured by the 
DEEP2 survey. However, for the magnitude ranges of inter- 
est in this paper, this limit is effectively of no importance. 
Second, we find that the fraction of matches with failed red- 
shift determination varies somewhat with magnitude (6 per 
cent failed at 18 ^ r < 19, 8 per cent at 19 ^ r < 20, 11 
per cent at 20 ^ r < 21, and 14 per cent at 21 ^ r < 22), 
implying an increase in failures at higher redshift; we may 
also have a problem if the majority of the failures lie at a 
particular part of the redshift distribution in a given mag- 
nitude range. We can place bounds on the effect of such a 
systematic as follows: we compute the change in (S^^) that 
results from assuming that all the failed redshifts were at 
and at 00 (which yields the 2^2:; asymptotic value of 
E^^). We can compute the fractional error 

'-^^ I \ /measured/ 

where /failed is the fraction of redshift failures (0.11), 
{^c^} failed is the average value of E~^ for those galaxies 
that had failed redshift determination, and measured is 

the average value of EJ^ for those galaxies that were used to 
compute the signal using the observed redshift distribution. 
If we assume all failed redshifts were 0, then (E~^) failed ~ 0, 
and therefore we expect a bias of -11 per cent (or —14 per 
cent for r > 21 and —9 per cent for r < 21), where our 
use of the redshift distribution from the redshift determina- 
tion successes overestimated E^^ (and therefore underesti- 
mated the signal) by that amount. If we assume all failed 
redshifts are S> zi , then the effect depends on the lens lumi- 
nosity (where it is less important for lower luminosity and 
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Table 4. The estimated overestimation of the lensing signal if 
all redshift failures are at very high redshift, as a function of lens 
bin and source sample. 



Lens sample (5(AE)/AE, per cent 





■r < 21 


r > 21 


LI 


0.1 


0.4 


L2 


0.2 


0.6 


L3 


0.5 


0.9 


L4 


0.9 


1.4 


L5 


1.6 


1.9 


L6 


2.7 


2.5 



redshift lenses, for which all sources were essentially at oo 
anyway) and the source sample. The degree to which the 
signal was overestimated in this case is given, for each lu- 
minosity bin and source sample, in tabled] As shown, the 
effect is at most 2.7 per cent overestimation for L6 with 
r < 21 sources, much less for the faint bins. The reality is 
somewhere in between these two extremes, most likely to- 
wards the higher end of the range quoted since we expect 
more redshift failures at higher redshift, but cannot easily 
be estimated, so we use these values to define the 95 per cent 
confidence interval. (This estimate did not take into account 
the effect of changes in EJ^ on the weighting, since the use 
of non-optimal weighting should increase the errors without 
inducing a bias.) 

We compared the magnitude distribution among the 
matches to that in our source catalog. If they are not com- 
parable, redshift distributions determined using the DEEP2 
sample may not be applicable to our source sample. Ta- 
ble |S] shows a comparison of the samples. The first column 
shows the fractions of galaxies in magnitude bins for our 
source catalog overall, and the second column shows the 
fractions in the Groth strip (with 95 per cent confidence 
intervals from the binomial distribution for the latter due 
to its small size). Since they agree within the errorbars, we 
need not worry about the Groth strip being very different 
from the full source catalog. The third column shows the 
fractions in magnitude bins for the full SDSS photometric 
sample (i.e. no lensing-related cuts), Groth strip only, re- 
stricted to the magnitude range for which the CFHT detec- 
tor does not saturate. The fourth column shows the fractions 
of galaxies actually targeted as a function of magnitude (for 
r < 21.8); this column is statistically consistent with the 
third column, implying that the targeting is indeed inde- 
pendent of the r magnitude. Finally, the fifth column shows 
the fractions of galaxies as a function of magnitude for the 
matches between the DEEP2 redshift catalog and our lens- 
ing catalog with successful redshift determination, which is 
consistent with column two, the fraction of lensing-selected 
galaxies as a function of magnitude in the Groth strip be- 
fore matching against DEEP2, with the exception that the 
DEEP2 galaxies are by necessity missing the r < 18 sample, 
which is roughly 2 per cent of our catalog. Furthermore, the 
fraction of galaxies categorized as high-redshift LRGs was 
similar in the two samples (9 per cent in our background 
catalog, versus 11.5^3 5 per cent in the DEEP2 matches at 
the 95 per cent confidence level), implying that there is no 
color-dependence of the targeting and redshift failures with 
regards to this particular subsample. 



While using the spectroscopic redshift distributions 
may seem to be the ideal way of determining Ec, there are 
two caveats that make this solution less promising. First, the 
use of average redshift distributions without any use of pho- 
tometric redshift errors means that many more physically- 
associated lens-source pairs are included in the calculation. 
While the resulting dilution of signal may be corrected for 
using boost factors, boosting is another potential source of 
systematic error since we cannot correct for intrinsic shear, 
and it is desirable that the boost factors be as low as possi- 
ble. Second, because these distributions were determined us- 
ing a small portion of the sky, they may be unduly influenced 
by large-scale structure, and not representative of the red- 
shift distribution in the survey as a whole (in Uti.ll we show 
how the effects of large-scale structure increase the errorbars 
determined purely using statistics) . The ideal solution to the 
second problem would be to have spectroscopic redshifts de- 
termined for a random sample of galaxies selected from the 
entire SDSS survey area, which could be used to determine 
source redshift distributions or photometric redshift error 
distributions. 

Consequently, we also used these DEEP2 redshifts to 
determine photometric redshift error distributions, enabling 
us to use photometric redshift information to eliminate 
physically-associated pairs, yet to also avoid any bias due 
to errors in the photometric redshifts. The determination of 
these error distributions was done by dividing the r < 21 
matches into bins based on photometric redshift (in order 
to determine error distributions as a function of photometric 
redshift). Our computation of E~^ then takes into account 
the photometric redshift error distribution: 

T.~^{zi,Zp) ^ J p{zs\zp)'E~^{zi,Zs)dzs (25) 

The signal is then computed as usual, but instead of assum- 
ing that Zs — Zp (i.e., p{zs\zp) = S{za — Zp)), we use the value 
computed by evaluating the integral in equation 1251 

Results from our work with the DEEP2 data are shown 

in 

3.4 Reference distribution: LRGs 

There is one subset of SDSS source galaxies for which ex- 
cellent redsh ift information is know n: Luminous Red Galax- 
ies (LRGs) jEisenstein et al]l200l[) . These galaxies have a 
well-known color-redshift relation that allows photometric 
redshifts to be determined with excellent precision. 

For the LRGs selected according to criteria shown be- 
low, the photometric redshifts perform significantly bet- 
ter than for the general source sam ple. Furthermore, as 
shown in iPadmanabhan et al.l l2004l . redshift distributions 
for LRGs from the 2dF-Sloan LRG and Quasar Survey 
(2SLAQ) can be used to construct very reliable redshift 
distributions for these galaxies. Note that for LRGs, the 
re dshifts used were from a sim ple template code discussed 
in IPadmanabhan et all i2004) rather than from kphotoz; 
however, for most regions of color space occupied by LRGs, 
the results are highly correlated. 

The selection criteria used for the LRGs are as follows: 

(i) d± = {r~i)~ 0.125(g - r) > 0.45 

(ii) C|| = 0.7{g - r) + 1.2(r - i - 0.18) > 1.6 
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Table 5. The fraction of source galaxies in each magnitude bin for our source catalog overall and in the Groth strip, all SDSS photometric 
galaxies in the Groth strip with 18 < r < 21.8, all DEEP2 galaxies targeted with r < 21.8, and the matches between DEEP2 and our 
source catalog. (95 per cent confidence intervals on all columns are from the binomial distribution; due to the constraint relating the 
sum of the numbers within each column, the errorbars arc anti-correlated.) 



Magnitude 


Lensing 


Lensing catalog. 


Photometric survey. 


DEEP2 


DEEP2 


range 


catalog 


Groth strip 


Groth strip 


targets 


matches 


r ^ 18 


0.022 


020+" °^'^ 


0.000 


0.000 


0.000 


18 < r <: 19 

19 < r ^ 20 

20 < r < 21 
21 < r < 21.8 


0.055 
0.153 
0.358 
0.411 


066+° °25 

U.UDD_Q 
n 1 97+0.033 
'J-^^' -0.028 
n •37S+0-044 
U-3'8_o.043 

Ain+°-°'^^^ 

U.'HU_(j 


n35+° °i2 
096+" °^® 
304+° °^'' 

U.OU'±_g Q2g 
U.ODO_Q g2g 


027+'''°^^ 
'-'•"^'-0.012 

U.U»D_g Q23 

316+° °'*^ 

U.01U_Q ggg 

n ^^8+0-042 


U.UOy:_g Q23 
r, iqo + 0.046 
U.100_g ggg 

U.OyO_g ggg 

Q -,7+0.060 
'-'■^^'-0.059 



(iii) g - r < 2.5 

(iv) r — i < 1.5 

(v) 0.4 <zp< 0.65 

(vi) r > 19 

These criteria were chosen as the combination of those 
from Eiscnstcin ot al. (2001) and from Padmanab han et alJ 
i2004l) that best suited the requirements of this work, 
namely very low levels of contamination from non-LRGs, 
from low-redshift LRGs, and from stars, yet sufficient num- 
bers that the LRG sample can be used as sources for weak 
lensing with reasonable signal to noise. 

Imposing these criteria on our source catalog yields 
2 884 242 LRGs total, though only about 65 per cent are 
in regions with lenses. A plot of the LRG redshift distribu - 
tion derived using 2SLAQ work jPadmanabhan et al ] |2004D 
is in Fig.|S| for this distribution, only the 72 per cent of the 
LRG sample with photometric redshift greater than 0.45 
were used because the inversion method gave unreliable re- 
sults for 0.4 < Zp < 0.45. As shown, the redshift distribu- 
tion peaks around z ~ 0.5 — 0.55, which illustrates another 
advantage of using LRGs as sources: when we use SDSS 
main spectroscopic sample galaxies as lenses, with redshifts 
mostly below z ~ 0.25, does not vary strongly with 

source redshift for sources at such high redshift, so an error 
in these distributions will make a very small difference in 
the results. 

Note that approximately 57 per cent of the LRGs 
in the catalog are f a inter than i model magnitude of 20. 
IPadmanabhan et al.l (|223) only includes redshift distribu- 
tions down to this limit, so our use of that inversion method 
for this sample is suspect. However, tests of photometric red- 
shifts using the DEEP2 matches indicate that that the mean 
bias is the same for the i > 20 LRGs as for the i < 20 LRGs 
(0.02), and the rms scatter is larger but not excessively so 
(0.07 vs. .05). which suggests that th e use of the inversion 
method in IPadmanabhan et alJ (|200^ with the same error 
distributions for i < 20 and i > 20 is justified here. 

In addition to computing the signal with the full sam- 
ple using the photometric redshifts directly, and with the 
restricted sample using the redshift distribution, we also 
computed it with photometric redshifts on two subsamples, 
those with > 0.5 and dx > 0.55. Because these stricter 
cuts would help avoid contamination by lower-redshift galax- 
ies being scattered up to higher redshift, we try imposing 
them on our sample; if the signal does not differ when we do 



LRG redshift distribution 




Q - 
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Figure 6. The redshift distribution for the high- 
reds hift LRG sample derive d via the inversion method 
from lPadmanabhan et all l2004h . 



this, then we know that using the full sample is relatively 
safe. 

Fig. |5| shows the r model magnitude distribution of 
sources classified as high-redshift LRGs, and Fig. 21 shows 
their R2 distribution, in comparison to the values for the 
overall source sample. As shown, the LRG sample is, on av- 
erage, at lower R2 and fainter magnitudes than the other 
source samples, and thus we may be concerned that it will 
have more significant shear calibration bias, as evidenced by 
the confidence intervals in Table El 

In principle, given the excellent redshift information for 
the high-redshift LRGs, one solution to our lack of reliable 
redshifts for all sources would be to do g-g lensing entirely 
with LRGs as sources. However, we then are faced with a 
problem of statistics, since LRGs are such a small fraction of 
the sources available (9 per cent), yielding large statistical 
errors on AE. This problem is the reason why our test for 
calibration bias is so useful; it allows us to use the excellent 
redshift information from LRGs to check the calibration of 
the lower-redshift source samples, which contain many more 
galaxies, enough to obtain excellent statistical errorbars. 
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4 OTHER SYSTEMATICS ISSUES 

Here we consider a number of remaining systematics issues 
that arise when computing the lensing signal. Some are cal- 
ibration uncertainties that scale with the signal, similar to 
the shear and redshift distribution calibration uncertainties 
discussed previously; others are systematics that do not scale 
with the signal, and have amplitudes that depend on the an- 
gular or physical lens-source separation. 

4.1 Random points test 

The random points test requires computing signal 
AErand('') using random lens catalogs (i.e., sets of random 
positions generated with the angular mask of the spectro- 
scopic survey area, using the same lens redshift and mag- 
nitude distributions as the real lens sample). In practice, 
these distributions are preserved by drawing the redshifts 
and magnitudes from the real sample, without replacement. 
The random points test is useful because a nonzero signal re- 
veals the presence of spurious shear power (systematic shear) 
in the source catalog which would lead to an additive bias in 
the lensing signal. The rando m catalogs used for this wor k 
were generated using mangle jHamilton fc Tegmarkll2004) . 

In the absence of systematic shear, we expect the ran- 
dom points test to show zero signal around random points. 
However, there is a slight smearing of images in the scan di- 
rection because the charge transfer is not continuous, instead 
occurring in very quick transfers from pixel to pixel, so that 
the PSF is convolved with a rectangle one pixel (0.4") wide 
along the scan direction only. The result is that the PSF 
is not circularly symmetric. This effect may be exacerbated 
if the scan rate and charge transfer rate are not perfectly 
matched, leading to convolution on a scale larger than one 
pixel.^ While in principle PSF-correction algorithms should 
correct for PSF asymmetry, in practice this is difficult to 
do perfectly. In addition, since the charge-transfer efficiency 
may also depend on the amount of charge, the PSF asymme- 
try may be different for faint objects than for bright ones, 
but since Photo fits the PSF using stars around r ~ 19, 
code that uses the Photo PSF may not be able to fully 
correct for this effect. We found that the re-Gaussianization 
scheme over-compensates for the effect. 

We address the presence of this spurious signal by de- 
termining it to as high precision as possible using A'^ random 
catalogs (where the number of random catalogs is limited by 
processor time), then subtracting it from the observed sig- 
nal. This procedure results in the errorbars on the signal 
rising by a factor of ^{N + 1)/N 1 + 1/{2N). Our choice 
of A'^ = 24 means that the errorbars only increase by 2.1 per 
cent. Results for the random catalog test are shown in 

However, random catalog signal subtraction, while 
widely accepted in the literature, ignores the possibility that 
fluctuations in the number density and systematic shear may 
be correlated. We describe this problem as follows: A g-g 
weak lensing measurement entails computing the correlation 
(717-1-). The measured number density of lenses n can be de- 
composed into n = n + 5n, an average density of sources on 
the sky plus fluctuations. The measured total shear 7+ can 

^ We thank James Gunn for pointing out this effect. 



be decomposed into 7+ = 7+ +7si/s, the true shear field plus 
systematic shear. When we compute the signal around ran- 
dom points, we obtain the quantity (n7+). Random catalog 
subtraction thus corresponds to measuring 

(717+) - {nj+) = {Snj+) = (5n7+) 4- {Snjsys). (26) 

The first term on the right side, (^717+), is the correlation 
that we hope to measure, but the second term is an addi- 
tive systematic error that has not been discussed in previous 
works on g-g weak lensing. We cannot assume that Sn and 
7sys are completely uncorrelated, because both quantities 
are slightly correlated in some way with the PSF. There are 
several ways in which such an effect could become signifi- 
cant. For example, since there is a gap between camcols on 
the SDSS camera, the same region must be rescanned with 
some offset to fill in that gap on a different night, which 
may have very different seeing and other conditions, such 
that 7sys fluctuates on small scales. If the number density of 
lenses also fluctuates on the same scale, then we could have 
some nonzero contribution from the (Sn^aya) term. For this 
work, we assume that this term is negligible, since the corre- 
lation should be small and has never been detected before, 
but this issue should be addressed more fully in future work 
to ensure that this assumption is reasonable. 

4.2 45-degree test 

Another useful test of systematics in the lensing signal is the 
45-degree test, which requires computing the lensing signal 
with the coordinate system rotated by 45 degrees. By in- 
version symmetry, the 45-degree rotated signal 745 should 
vanish, with weak lensing only contributing to the tangen- 
tial shear, 7+. The presence of such a signal could indicate a 
variety of shear systematic errors, since they generally con- 
tribute both to 7+ and 745. We computed the 45-degree 
rotated signal for all three source samples, so that together 
these tests were done for all sources in the catalog (we as- 
sume that the choice of redshift distribution does not matter 
for this test, as 745 would be nonzero due to shear system- 
atics, so we only use one choice of redshift determination 
method for each of those samples) . Results for this test are 
in 

4.3 Star-galaxy separation 

Star-galaxy separation is an important issue for weak lens- 
ing, since the inclusion of stars in the source sample would 
dilute the signal. Thus, a balance must be struck, ensur- 
ing the purity of the galaxy sample used as sources, while 
avoiding being overly conservative and eliminating too many 
galaxies in the course of doing this separation (which would 
lead to poor statistics). Star-galaxy separation for this cat- 
alog was accomplished via two cuts: first, the requirement 
that the Photo flag OBJC.TYPE be equal to 3, or galaxy; 
and second, the requirement that R2 > 1/3 (i.e., the object 
must be 50 per cent larger than the PSF). T he type deter- 
mination for DRl and DR2 is described in iLupton et alJ 
(2001); in brief, it utilizes the linear combination of fits 
of galaxy profiles to two models (exponential and de Vau- 
couleurs), then allows the ratio of the fiux in a fit to a 
PSF shape to that in the linear combination of galaxy 
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models to determine the type, via the requirement that 
mpsF — rricmodei 0.24 for galaxies. As shown in figure 
3 in that paper, this procedure ensures a relatively pure 
galaxy sample even close to r = 22, with stellar contamina- 
tion fraction (determined using Hubble Space Telescope, or 
HST data in the Groth strip) being negligible at r brighter 
than 21, and 7 per cent for 21 < r < 22. It is also clear 
from that figure that OBJC_TYPE is much more likely to 
default to calling galaxies to stars rather than vice versa, 
with galaxy contamination of ~ 33 per cent in a sample of 
"stars" for 21 < r < 22. While there are probabilistic meth- 
ods that are more accurate at the faint end (r > 21), such 
as tha t used in ^cranton ct al. ( 20o3) and ISheldon et alJ 
(|2003), OBJC.TYPE's conservative tendency should not 
cause significant stellar contamination, though it does re- 
duce the density of sources in poor seeing at faint magni- 
tudes. 

In order to check that our shape measurement cuts re- 
duce stellar contamination due to failures in OBJC_TYPE, 
we used publicly-available catalogs from the Great Obser- 
vator ies Origins Deep Survey (GOODS), carried o ut via the 
HST jGiavalisco et al.l2 005'. Gia vahsco et al.l2004l) . Charac- 
terizations of stars versus galaxies are much more accurate 
to fainter magnitudes in space-based surveys such as this 
because the PSF is much smaller. We found 577 objects at 
r < 21.8 (of which 60% are at r > 21) that were matches 
between the full SDSS photometric catalog and the GOODS 
north field, centred at Dec. -1-62° 15' and RA 12''37™.7. We 
found that the "galaxy" classification is incorrect about 1.5 
per cent of the time for r < 21 and 7 per cent of the time 
for 21 < r < 21.8 objects, and the "star" classification is 
incorrect a larger fraction of the time (3 per cent at r < 21 
and 40 per cent at 21 < r < 21.8), confirming the results 
from iLupton et all J200j) that OBJC.TYPE tends to de- 
fault to calling small, faint objects stars. 

However, when we restrict to the subset of 146 objects 
that passed all shape-measurement and other criteria to be 
included in our source catalog, of which 73 are at r < 21 
and 73 at r > 21, we find that only 2 (1.4 per cent) of those 
included are stars, or per cent contamination in the r < 21 
sample and 2.7 per cent contamination in the r > 21 sam- 
ple. We see that the resolution-factor and other cuts reduce 
stellar contamination by a factor of three from the result us- 
ing OBJC_TYPE alone. Using the binomial distribution to 
get 95 per cent confidence intervals on out stellar contami- 
nation estimates yields [0, 0.040] (r < 21) and [0.003, 0.096] 
(r > 21). Technically, we should take into account that the 
GOODS north field is at 1/ sin b ~ 1.2, but the average value 
for the full lens sample is {l/sin6) ~ 1.40, so we might ex- 
pect slightly higher stellar contamination in the full catalog 
than that computed for the GOODS field. However, because 
the dependence of stellar contamination on 1/ sin b is diffi- 
cult to model, we do not attempt any correction. 

To check the signal for contamination of our source cat- 
alog by stars, we computed the signal at low versus at high 
galactic latitude (cutting at sin 6 < 0.7) to compare the re- 
sults. Note that while finding a lower signal at low galactic 
latitude may indicate a problem with star/galaxy separa- 
tion, it may also indicate the presence of other systematics. 
In particular, since the extinction is greater at low galac- 
tic latitudes, galaxies near the faint end at a given mag- 
nitude were actually lower signal-to-noise measurements at 



low galactic latitude than they were for high galactic lati- 
tude, so in principle there could be a shear systematic caus- 
ing a difference between these two samples as well. The re- 
sults of this test will be shown in H6.3I 

4.4 Seeing dependence of calibration 

Because our ability to correct the galaxy image for efi^ects 
due to the PSF depends on the relative size of the galaxy 
and the PSF, we must consider the possibility of a seeing- 
dependent shear systematic. Consequently, for each galaxy, 
we consider the size of the PSF used, and split our sample 
into "good seeing" (PSF size less than the median value, 
1.25 pixels in the r band) and "bad seeing" (PSF size greater 
than the median value). The signal was then computed using 
these two source samples, and compared. Results for this test 
will be shown in H6.3I 

4.5 i?2 dependence of calibration 

Because some calibration biases may be more prominent 
at lower R2, it is important to check for i?2-dependence of 
the calibration. There are several effects that could lead to 
apparent R2 dependence of the calibration: noise rectifica- 
tion bias, selection biases, and biases due to PSF-correction, 
which would be particularly important for less-well resolved 
galaxies. We computed signal using sources with R2 greater 
than and less than 0.55 in each band to check for bias; results 
of this test are presented in H6.3I 

4.6 Systematic differences between bands 

Like many other studies, we used shape measurements av- 
eraged over two bands, the r and i bands. While the shape 
measurements between the two bands may legitimately dif- 
fer for individual objects, due to (for example) spectral dif- 
ferences in emissions from the disk versus from the bulge 
of spiral galaxies, we also checked to ensure that the signal 
computed with the shape measurements from each band in- 
dividually gives the same AE. The results of this test will 
be shown in t|6.3l 

4.7 Boosts 

As discussed in H04, the lensing signal at small transverse 
separations is diluted by the inclusion of sources that are 
physically associated with the lens (i.e., are in the same 
group or cluster), and therefore are not really lensed. To 
correct for this effect, the signal for a given luminosity bin 
is boosted according to the weighted number of galaxies per 
unit area relative to the number from random catalogs. The 
signal is multiplied by a factor 



where n{r) is the weighted number of galaxies per unit area 
when the signal is computed, and nrand{r) is the same com- 
puted with random lens catalogs. The number from random 
catalogs takes into account the decrease in the number per 
area with radius due to survey edge effects, so the boost 



© 0000 RAS, MNRAS 000, 000-000 



16 Mandelbaum et al. 



can accurately account for the dilution of signal by physi- 
cally associated pairs. Consequently, B{r) — 1 — S^ia is the 
lens-source correlation function. 

The boosts add two sources of statistical error and two 
sources of systematic error. In the rest of this section we 
consider each of these. 



4.7-1 Statistical errors 

The statistical error arises because the boost factor is the 
ratio of two noisy quantities, n(r) and nrand(r). The noise 
in nrand{r) can be minimized by computing signal from a 
large number of random catalogs, where 24 are used for 
this work. However, for subsamples with a small number 
of lenses, nrand(r) is still slightly noisy at small separations 
due to the small size of the radial bins; this noise is taken 
into account in the bootstrap by multiplying the boost from 
each dataset and radial bin by a Gaussian random number 
of mean 1 and standard deviation equal to the fractional er- 
ror in nrand{r) as computed from the random catalogs. The 
noise in n(r) is taken into account naturally by the boot- 
strap, since each bootstrap resampled dataset will have a 
slightly different n{r) used for the boost. Hence, the statis- 
tical error due to the boosts is simple to take into account. 



4-7.2 Systematic error: non-uniformity of boost factor 

One potential systematic error arises because both AE and 
the boost factor vary strongly with luminosity at the bright 
end of the lens sample, so the luminosity bins that are 1 
magnitude wide may be too wide to properly compute the 
signal in the innermost radial bins, r < 50 /i~^kpc, where 
the boost is most important. By averaging over a large range 
in luminosity, with AE(r) and B{r) varying with luminos- 
ity, we may run into a situation where the product of two 
averages (AE(r) and B{r) separately averaged over lumi- 
nosity, then multiplied) differs significantly from what we 
really want, the average of products (i3(r)AE(r) averaged 
over luminosity). 

One rudimentary method of detecting the effects of us- 
ing wide luminosity bins on the boost factor is to split the 
brightest luminosity bin in half, compute the signal sepa- 
rately for each half (boosting each one individually), then 
average the signal from each half. The resulting signal can 
be compared against the signal computed using the full lu- 
minosity bin. Results for this test will be presented in tl6.3l 



4.7.3 Systematic error: magnification bias 

Another boost-related source of error is magnification bias, 
since the number of galaxies per area around real lenses may 
not be expected to be the same as that around the random 
points. There are three competing effects: first, that due to 
the magnification = 1 -I- 2fi; (in the weak lensing limit), 
where k = S/Ec, the number of lens-source pairs per unit 
area on the sky will decrease; second, that the magnifica- 
tion means that fainter sources will be visible than would 
have been otherwise, and therefore the number of lens-source 
pairs per unit area will increase; and third, the magnifica- 
tion changes the resolution factors of the source galaxies. 



The competition between the first two effects can be quan- 
tified by s = d\ogiQN{m)/dm, where N{m) is the total 
number density of source galaxies given a faint magnitude 
limit of m. For the r > 21 sample, we must taken into ac- 
count the loss of sources at the bright magnitude limit of 
21, and compute 

^ ^ dlog„)iV(m-,m+) _ dlog^^p N(m^,m+) 
dm+ dm_ 

where m+ is the faint magnitude limit and m_ is the bright 
magnitude limit. We compute s separately for each source 
sample: for the r < 21 source sample, we find s — 0.36; for 
21 < r < 21.8, we find s = 0.47 - 0.57 = -0.10 (i.e. when 
the magnitudes are shifted brighter, we lose more galaxies at 
the bright end than we gain at the faint end because of our 
cuts on the shape measurement); and for the LRG sample, 
we find s — 0.27. 

The resolution factor dependence of this effect has not 
been previously evaluated. If we take the Gaussian approx- 
imation for the galaxy, T'^' ^ T^^^ + T^^\ and note that 
T^-^^ oc /i in the weak lensing regime, we find that the effect 
of magnification is to adjust the resolution factor by 

5R2 = ~5[j^jjj^2{l-R2)K. (29) 

The number of galaxies that are gained due to the resolution 
factor cut is then 

8N 

— (res.) = 2(1 — -R2min)n(7i2min)K, (30) 

where n(i?2min) is the resolution factor distribution normal- 
ized to n{R2) dR2 ~ 1. The total change in the num- 
ber density o? galaxies is then 

— = [5s - 2 + 2(1 - i?2min)n(7?2min)] K. (31) 

For the three source samples, using the values of s given 
above and n(-R2,min) from H2.2.2I we estimate SN/N = 1.9k, 
0.7k, and 3.1k for r < 21, r > 21, and LRG samples, re- 
spectively. (Without taking into account the effect of the 
change in R2, we would have had SN/N = —0.2k, —2.5k, 
and —0.6k, so this effect significantly changes our sensitivity 
to magnification bias.) 

The convergence can be simply estimated for roughly 
power-law profiles 74 oc as 

K(r) = ^1 _ ij 7t(r-) -f k{oo), (32) 

where k(cxd) represents the mass-sheet degeneracy. We will 
ignore this since our boost factors approach unity at large 
separations. Since for our galaxies we find a ~ 0.85, it then 
follows that K « 1.474. We can then use computed values 
of AE and {E~^) to estimate K(r). Table|S|shows the best- 
fit power-law K(r) and our resulting predictions for SN/N 
for L3-L6; LI and L2 are not used because the shear is 
statistically consistent with zero in these bins, and therefore 
so is K. 

This effect is not taken into account explicitly, and 
therefore may lead to a systematic bias in the signal, since 
we assume that the increase in sources around lenses is due 
to physically associated pairs. For all samples, we may be 
overestimating the boosts and therefore AE (particularly 
for LRGs). We do not, however, attempt to correct for this 
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Table 6. Parameters of powor-law fits K(r) = ko{^'/1'i ""^Mpc) ° 
to the lensing signal. The resulting value of 5N/N as a function 



of lens and 


source 


sample 


is shown as well. 




Lens 


10"* KQ 


a 


SN/N 


5N/N 


sample 






(30 /i-ikpc) 


(100 /i~ikpc) 






r < 21 


without LRGs 




L3 


5.5 


0.68 


0.011 


0.005 


L4 


4.7 


0.88 


0.020 


0.007 


L5 


7.9 


1.00 


0.050 


0.015 


L6 


29 


0.95 


0.15 


0.049 


r > 21 


L3 


9.7 


0.47 


0.004 


0.002 


L4 


5.3 


0.81 


0.006 


0.002 


L5 


9.3 


0.92 


0.016 


0.005 


L6 


40 


0.85 


0.055 


0.020 


LRGs 


L3 


5.4 


0.70 


0.019 


0.008 


L4 


4.4 


0.93 


0.036 


0.012 


L5 


11 


0.95 


0.095 


0.030 


L6 


44 


0.89 


0.31 


0.11 



attempt explicitly, since until a concrete detection is made, 
these estimates should not be treated as certain. For the 2a 
confidence intervals, we use the estimates in table |S| as the 
expected value, with an expected uncertainty equal to half 
the magnitude of the estimate. 

4.8 Intrinsic Alignments 

Satellite galaxies that are physically associated with a lens 
will not be lensed; however, it is often difficult to remove 
them from the source sample in the absence of good red- 
shift information, which is why we must use the boost fac- 
tors described in the previous section. If the satellites are 
actually aligned with the lens in some way (radially or tan- 
gentially elongated) , this alignment will cause a f alse lensing 
signal. This effect is well-m otivated theoretically iLee fc PenI 
l200ll:lHui fc Zhandl2002h . and there are several claimed de- 
tections in the literature of correlations between the galaxy 
density (or specific features thereof, such as the Supergalac- 
tic P l ane) and intrinsic ellipticities (e.g. | Fhn fc Godlowski 
Il989l: iLee fc Penll2002l: iNavarro et alJl2004^ ■ We use AE.„f 
to represent the signal due to intrinsic shear, which is the 
shear signal that would be computed using our shear estima- 
tor with a population of physically associated galaxies. Note 
that the galaxy density-shear correlation that concerns us is 
distinct from the intrinsic shear-shear corr elations that add 
spurious power to cosmic shear surveys (Croft & Metzler' 
[2000 ; Heavens et al, 20QQ.; Lee fe^Pcn 2000; Catolan et al. 
I2061I: (Crittenden et alji200li: rjinall2o'o2h . 

In order to place upper limits on contamination due to 
intrinsic shear, we must determine what fraction fc of the 
lens-source pairs are physically associated, and the typical 
intrinsic tangential shear A^i„t of these sources. To compute 
fc, we use our results for B{r). Limits on A7i„t (or on the 
intrinsic ell i pticit y or position angle) have been obtained by 
iLee fc PenI 1200 ll) . lBernstein fc Norberd J2002h and H04; we 



use the results from H04 as these are the tightest constraints 
on relevant scales. Results for this estimation will be shown 

in a?nin 

4.9 Correction for non-volume limited lens sample 

In this section, we describe the corrections that account for 
the fact that the lens samples are not volume limited, when 
comparing between different source subsamples. These cor- 
rections required us to compute the mean weighted luminos- 
ity for each lens-source sample combination. This computa- 
tion was done as a function of transverse separation in order 
to check for any systematic effects. 

We found that there is a slight variation in the 
mean weighted luminosity with transverse separation, be- 
cause brighter lenses are more clustered, so in the regions 
where physically-associated pairs are abundant (i.e., where 
(,is(r) ^ 0), the mean weighted luminosity of the sample 
was higher than at larger radii where €,is{r) ~ 0. We see 
the same trend in the mean weighted redshift of the sam- 
ple, as well. However, this effect does not really mean that 
the mean weighted redshift or luminosity corresponding to 
the computed signal is larger at small radii, since the excess 
pairs around the brighter lenses that cause this effect do not 
contribute to the signal. 

We also expect that Lcsix) and Zaa(r) may increase at 
large r, because for more nearby (less luminous) lenses, the 
same transverse separation corresponds to a larger angular 
scale, and therefore survey edge effects may cause nearby 
lenses to lose pairs faster than distant lenses. The observa- 
tion of such an effect would indicate a problematic selection 
effect, but fortunately we do not see it at the maximum 
transverse separations studied in this paper (2 /i~^Mpc). 

The correction when comparing AE at slightly differ- 
ent Left is as follows: we use the results of the signal aver- 
aged over radius in each luminosity bin to derive a relation 
AE(L). Then, when we compare signals at two slightly dif- 
ferent luminosities (typical differences are less than 1 per 
cent) Left and LcS + SL, we assume that rather than being 
1, the ratio of the two signals should be 

AE(Left + 5L) ^ {dAJ:/dL)L„,,5L 

AE(Left) AE(Left) • ^ ' 

If this correction is non-trivial compared to the statistical 
error, then we must apply it; otherwise, we do not. 



5 APPLICATION OF THE SYSTEMATIC TEST 

In this section, we describe our test to determine the effects 
of the redshift distributions and systematic errors described 
in ^ and ^ and show calculations justifying our assertion 
that cosmology plays a negligible role. 

5.1 Methodology 

First, in order to compute bootstrap errors on the results, 
we divide our lens catalog into 200 bootstrap subsamples as 
described in H2.4.3I For each lens luminosity bin and source 
sample, we compute the signal in each of the bootstrap sub- 
samples, using 46 logarithmically spaced radial bins from 



© 0000 RAS, MNRAS 000, 000-000 



18 Mandelbaum et al. 



20 to 2000 /i~^kpc in order to measure AS(r) more sensi- 
tively where it varies the most, at small transverse separa- 
tion. (Plots of signal shown in this paper will have many 
radial bins averaged at small r so they are easier to read.) 

Next, to account for physically associated pairs and sys- 
tematic shear, we used random lens catalogs to compute 
AErand(?') and rirandir). Twenty- four random catalogs were 
used, so that these functions were determined reasonably 
smoothly. 

After this procedure, we performed bootstrap resam- 
pling, generating 2500 datasets for each lens luminosity bin 
i and source sample a as follows (to avoid confusion, we 
use Roman letters to denote lens luminosity bins and Greek 
letters to denote source samples): 

(i) For each lens-source sample combination, we use the 
same set (in order to take into account correlations) of 
random numbers drawn from a uniform distribution to 
choose 200 bootstrap subsamples randomly, with replace- 
ment. The signal from these subsamples are then averaged 
to get signal for each of the lens-source sample combinations, 
AE(j"^)(r). 

(ii) We compute boost factors Bi^air) using the weighted 
number of pairs per unit area n(r) relative to the average 
weighted number from random catalogs nrand{r). The er- 
ror in n(r) is taken into account by the bootstrap proce- 
dure itself, and the error in nrand{r) is taken into account 
via multiplication by a Gaussian random number of mean 1 
and standard deviation equal to the fractional uncertainty 
in Urandir), wherc a different random number is used for 
each i, a, and radial bin. 

(iii) We use the measured random catalog signal, taking 
into account its uncertainty via multiplication by a Gaussian 
random number of mean 1 and standard deviation equal 
to the fractional uncertainty in the random catalog signal, 
where a different number is used for each i, a, and radial 
bin. This procedure yields AE^'^"'*''(r). 

(iv) Finally, we compute the signal for each dataset, 
AE.,4r) = B,,4r)(AE(™'="^)(r) - AE(™"^'(r)). The ran- 
dom catalog signal (not just the signal measured with the 
real lens catalog) must be multiplied by the boost factor 
because all galaxies, whether physically associated with the 
lenses or not, have systematic shear associated with them. 

Once the datasets have been generated using this procedure, 
we can then do the following: 

(i) By averaging over the 2500 datasets, we can compute 
the mean value of AEi,ci('') (and other quantities such as 
Bi^a{r)) and its standard deviation. 

(ii) Using the bootstrap errors from the previous step for 
weighting purposes, we can compute a value of AEi,^ = 
(AEi,„(r)(r/l/i"^Mpc)°-*'') for each bootstrap resampled 
dataset. The (r/lft^^Mpc)" *^ is included in order to in- 
crease signal to noise on the averaged value. Only bins with 
r > 30 kpc//i are included in this calculation. 

(iii) By averaging over the 2500 values of AEi d , we can 
get the mean value (AEi^a) and its covariance matrix, where 
its value for overlapping source samples will have very high 
correlations (the extreme cases of identical source samples 
but different methods of computing E^^ typically have cor- 
relation coefficient r — 0.99). This (AEi,^) will be used to 



compare the signal for different sets of sources, the same lens 
sample. 

(iv) For each luminosity bin i, we can compute ratios of 
the averaged signal with different sets of sources, 



ili.a.fl — -== — r 

(AEi,^) 



(34) 



and its error. The errors on the numerator and denominator, 
and their covariance, were used to compute confidence inter- 
vals on Ri,a,i3 using the full non-Gaussian error calculation, 
which is important for those cases in which either number 
may be consistent with zero. (For details of the calculation of 
confidence intervals on the ratio of two correlated variables, 
see Appendix]^) Due to the high correlations between some 
of the source samples, the ratio could often be computed to 
extremely high accuracy. 

(v) For many of the tests described below, the ratio test 
was also done for the signal averaged over luminosity bins 
as well. This comparison must be done with care; since the 
relative weights of the signal in the different luminosity bins 
varies from sample to sample (e.g., all LRGs contribute to 
each luminosity bin, but there are many more r < 21 galax- 
ies in the faint bins than in the bright ones due to the 
Zp > zi + 0.1 cut, so for r < 21 the fainter bins get more 
weight than the bright ones relative to the LRG sample) , we 
must be careful that when comparing samples a and /3 aver- 
aged over luminosity, we use the same weights for averaging 
each sample over luminosity. If the weights differ, then we 
could mistakenly be led to believe that there is a calibra- 
tion error. So, for each bootstrap dataset, we compute the 
average value of AE via 



AEaii,Q = ^ Wi AEi 



where 



■Wi = 



(35) 



(36) 



Here, AE,; is some approximation to the actual value of AEi 
(in practice, the value averaged over subsamples a and /?), 
and 

1 



(37) 

"* '^L+'^?,;3-2Cov(AE,,c,AE,,^)' 

The values of a used in this expression are from the co- 
variance matrix already obtained for (AEi^a). By averaging 
over bootstrap datasets, we can then get values (AEaii.a) 
that allow us to compute ratios 



'all, a, 3 — 



(AEall,Q:) 



(38) 



(AEaii./g) 

and obtain non-Gaussian confidence intervals as usual. 
5.2 Cosmology dependence 

In this section, we show why the systematics test is essen- 
tially cosmology-independent. We compute AE by comput- 
ing 7t (which is independent of cosmology, except via the 
weighting scheme since wis oc E^^, which should not change 
the results, only the errorbars) and E^gucss with our as- 
sumed cosmology. Now, we propose that our cosmological 
model is wrong, so the true value of AE can be computed 
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Table 7. The ratio fcos relating the true value of AS for two 
different cosmologies to the one measured using the assumed cos- 
mology, shown for several lens redshifts and source redshifts. 



2; 




fcos 


fcos 






(flat = 1) 


(open Urn = 0.3) 


0.02 


0.12 


0.980 


0.987 


0.02 


0.70 


0.981 


0.988 


0.10 


0.20 


0.910 


0.941 


0.10 


0.70 


0.915 


0.946 


0.25 


0.35 


0.814 


0.878 


0.25 


0.70 


0.821 


0.885 



from the same 7^ that we computed, but with a different 
that we will call E^^^^,. Our results for a given lens 
and source sample are then related to the true AE by the 
relation 



AE 

measured = AEt rue 



(39) 



so we can define a "cosmology factor" fcoa = E~^^^(,/E^guogs 
relating the true and measured signal. Our concern, then, is 
that fcos for a given lens redshift will be different for each 
source sample, so that our comparison of AE computed with 
different source samples is invalid. 

We do a simple test for three lens redshifts {zi = 0.02, 
the minimum value; 0.1, the typical value; and 0.25, on the 
high-redshift tail of the lens redshift distribution), and two 
source redshifts {zs = z;-|-0.1 and 0.7) that span the range of 
source redshifts used. Furthermore, we test two cosmologies 
that are drastically different from the one assumed: a flat 
cosmology with £7^ = 1, and an open cosmology with f2m = 
0.3. For each cosmology, lens redshift, and source redshift, 
we can compute /cos relating our measured AE and the true 
value, and compare the values of fcos between the different 
source samples for a given lens redshift and cosmology. The 
results of this test are shown in table |7| 

We can see from Table|7|that while the measured value 
of AE may be off from the true one by a significant frac- 
tion, the differences between /cos for the same lens redshift 
but different source redshift are less than 1 per cent even 
for these extreme cosmologies for all cases. In reality, most 
of the lenses are near z ~ 0.1, most of the sources are at 
more intermediate values of redshift, and the allowed ranges 
of cosmology is much smaller than the extreme cases con- 
sidered here. Consequently, we can state with confidence 
that cosmology plays a negligible role in the comparisons 
of source samples for the allowable range of cosmology and 
lens redshift range probed by the SDSS. 

There were several over-simplications involved in this 
simple calculation. First, we did not take into account 
the lens or source redshift distributions; however, since we 
found that cosmology is not important for the range of 
lens redshifts used, and for the most extreme values of 
source redshift, this over-simplification is unimportant. Sec- 
ond, we did not account for the change in weighting when 
EJ'^ changes from the assumed to the true cosmology, but 
this change simply means that the weighting scheme was 
not optimal, which would lead to larger errorbars but no 
change in the result. Finally, we did not account for another 



change in AE due to the change in cosmology: since it is 
computed as a function of transverse comoving separation, 
r = QisD a(zi)(\ + zi) , cosmology also comes into the compu- 
tation of r for a given lens-source pair. The ratio r guess /Ttruc 
is nontrivially different from 1 for the two cosmologies con- 
sidered, and since AE cx r~°, rescaling r leads to AE being 
off by some factor. However, this effect is only a function of 
lens redshift, so the rescaling will be the same for all source 
samples, and therefore does not affect our use of the ratio 
test for systematics. 

Finally, it is worth noting that for futuristic surveys 
with lens redshifts around zi ~ 0.5 — 0.7 and two source 
samples at Zs ~ 1 and Zs ~ 1.5, the differences between fcos 
for the two source samples for the cosmologies considered 
here is more than 1 per cent, so if the systematics are better 
under control, then this sort of difference may be detectable 
with future datasets. 



6 RESULTS 

In this section, we present results for DEEP2 redshift distri- 
butions and photometric redshift error distributions; com- 
pare the different methods of error determination; and 
demonstrate the results of the systematics test. 



6.1 Redshift distributions and photometric 
redshift performance 

Here we describe the redshift distributions and photomet- 
ric redshift error distributions used for the rest of the paper. 
As mentioned in H3.3I several tests were performed using the 
DEEP2 redshifts. First, they were used to determine the red- 
shift distribution of the source galaxies, and then they were 
used to study the photometric redshift error distributions. 



6.1.1 Redshift distributions 

The redshift distribution determination using DEEP2 galax- 
ies was done by choosing a common functional form for the 
redshift distribution, the F distribution: 



p{z) 



dP 

Az 



z I z 



z?r(Q) 



(40) 



This probability distribution has mean {2) = azs, variance 
((2 — (2))^) ~ olzI, and mode {a — l)zs. A maximum- 
likelihood fit was done for parameters a and Zs for 18 < 
r < 21 and r > 21 galaxies separately, with fit results shown 
m table IHl The fit was performed by minimizing 



= ^[-21np(z,) 



(41) 



where the summation is over all DEEP2 matches in the ap- 
propriate magnitude range. This "x^" function can be min- 
imized to give the best-fit redshift distribution, but it does 
not have the same distribution as the usual function; its 
statistical properties are summarized in Appendix ^ For 
example, goodness-of-fit cannot be measured in an abso- 
lute sense directly from the x^i since the "zero-point" is 
undetermined; only differences between the fits for different 
models are significant. Note that since the majority of the 
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Table 8. Parameters of best-fit redshift distributions for several 
source samples. For both samples, r{zs,a) = —0.94, so the fit 
parameters are highly correlated. Results for errorbars and the 
variance of the are shown both with and without large-scale 
structure taken into account. 



Sample 


18 < r ^ 21 


r > 21 


Ns 


162 


116 


Zs 


0.100 


0.105 




0.012 


0.014 


o"LSs{2s) 


0.047 


0.017 


a 


3.52 


4.36 


C7{a) 


0.38 


0.55 


o-LSs(") 


1.77 


0.70 




0.35 


0.46 


a{z) 


0.19 


0.22 


Taodc(z) 


0.25 


0.35 




-117 


-43 


(X }theory 


-115 


-42 


Var(x2) 


283 


194 


VarLSs(x^) 


4917 


361 



galaxies are located in the peak of the probability distribu- 
tion, where p{z) > 1, the best-fit < 0. Table |5] summa- 
rizes the fit results, and includes errorbars both with and 
without large-scale structure (LSS) taken into account; LSS 
is taken into account as described in Appendix IX| assum- 
ing an gular correlation function w{9) from Conn olly et alJ 
l|2002). Because that w{6) may not be correct for lensing 
selected galaxies, this model may not perfectly capture the 
increase in errorbars due to LSS, but should compute them 
to within reasonable accuracy. For the r > 21 sample, we 
used amplitude of the ang ular correlation f unctio n for the 
21 < r < 22 sample from IConnollv" et al.l (|2003). For the 
r < 21 sample, our errorbars including LSS are a worst-case 
estimate, where we combined the amplitudes A\, A2, and 
for 18 < r < 19, 19 < r < 20, and 20 < r < 21 sources (with 
fractions of galaxies /i, /2, and /s) from that work to get an 
overall amplitude A = {fi^/Al + h^/^ + /sa/^)^, which 
assumes perfect correlation between the samples. While a 
perfect correlation is unrealistic, we use this result to place 
conservative errorbars with LSS taken into account. 

The resulting distributions for lensing-selected galaxies 
are shown in figure |3 along with the histogram of redshifts 
from DEEP2, and the COMBO-17 distribution for all galax- 
ies. While the fits are reasonably good, it is apparent that 
the r distribution may not be the best choice of distribu- 
tion because its shape is not well-suited to matching the 
shape of the histogram of redshifts from DEEP2 for r > 21. 
However, we have not found another functional form that 
would be more appropriate. Note that this plot shows the 
actual redshift distributions; when the signal is computed, 
the effective source redshift distribution, which involves the 
weights used for each lens-source pair, is more important. 
These weights tend to emphasize the higher-redshift por- 
tion of the curve via the inclusion of but that por- 
tion is also downweighted due to the fact that more of the 
higher-redshift sources are detected with lower significance, 
and therefore have higher cje- AH plots of redshift distribu- 
tions shown in this paper are the unweighted versions. As 
shown, the COMBO-17 distribution is at slightly higher red- 
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Figure 7. The redshift distribution from COMBO-17 and 
DEEP2, shown separately for 18 < r < 21 and for r > 21 galax- 
ies. The lines are, as labeled, the COMBO-17 p(2|r) averaged over 
p{r) for all galaxies, and the best fit F distribution from DEEP2 
for lensing-selected galaxies. The histograms are derived from the 
actual data. 

shift than the best-fit DEEP2 distributions; the discrepancy 
may be attributed to an offset in the magnitudes used to 
determine the distribution, and the fact that the galaxies 
involved in its determination were not required to pass our 
selection criteria. 

As compared to DEEP2 matches for the full photomet- 
ric sample, including those that had failed shape determi- 
nation and therefore were not in the source catalog, these 
redshift distributions are at slightly lower redshift on aver- 
age, as would be expected. 

We can estimate the possible variation of the signal due 
to statistical error in the redshift distribution determination. 
This calculation can be done using the statistical errorbars 
from the fits, and the larger ones that include LSS, for both 
r < 21 and r > 21 sources. We determine errorbars on 
via propagation of the fit covariance matrix C for the vector 
of parameters a = {zs,a) via 

For this computation, we use two-sided derivatives with re- 
spect to Zs and a evaluated at separations iO.lo" around the 
best-fit parameters Sfu- We can then compute the calibra- 
tion uncertainty on the signal, 5(AE)/AE = — 5(E^^)/E~^ 
for a given lens redshift. Results for this computation are 
shown in table|5] As shown, the uncertainty in the signal due 
to the redshift distribution statistical uncertainty is larger 
for higher zi since those lenses are closer to the source red- 
shift distribution; for lenses at zi = 0.03, the majority of the 
distribution is essentially at 2 = 00, so dEj^/da is small. 
Furthermore, the errors on the distribution at r < 21 have 
a greater effect than those on the distribution at r > 21 
because the brighter sources are, on average, closer to the 
lenses. As shown, the errors for the 18 < r < 21 sources 
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Table 9. Estimated calibration uncertainty on AS for two values 
of zi and p{z) computed for r < 21 and r > 21 sources. The 
uncertainties are shown both with and without LSS taken into 
account. 



2; 




i(AE) 
AE 

(18 < r < 21) 


i{AE) 
AE 
(r > 21) 


0.03 


without LSS 


±0.007 


±0.0055 


0.03 


with LSS 


±0.040 


±0.0073 


0.20 


without LSS 


±0.053 


±0.045 


0.20 


with LSS 


±0.25 


±0.057 



with LSS are quite large (large enough for zi — 0.2 that we 
may worry about the accuracy of this linear calculation), 
but that is due to the fact that we used a very conservative 
estimate of w{0) for this sample as described above. Conse- 
quently, those errorbars are very conservative, and in reality 
are most likely significantly smaller and closer to the error- 
bars without LSS. Nonetheless, we can see that using the 
redshift distribution for r < 21 may be a bad idea because 
errors in its determination may have a large effect on AE. 



Table 10. Conditional bias (S) and scatter ag as a function 
of r model magnitude for non-LRGs with successful photometric 
redshift determination 



Magnitude range 


(r> 




{S) 


0-6 




All 


Zp 






18 < r < 20 


19.3 


46 


0.035 


0.14 


20 ^ r < 20.5 


20.2 


39 


0.008 


0.13 


20.5 ^ r < 21 


20.8 


50 


0.084 


0.17 


21 < 22 


21.5 


100 


0.11 


0.25 


Zp 


> 0.12 only 






18 ^ r < 20 


19.2 


39 


0.032 


0.15 


20 ^ r < 20.5 


20.2 


32 


-0.022 


0.12 


20.5 < r < 21 


20.8 


44 


0.058 


0.16 



Table 11. Photometric redshift conditional bias (S) as a function 
of Zp for non-LRGs with successful photometric redshift determi- 
nation. Only r < 21 sources are included; conditional biases and 
scatter become worse for r > 21 sources, so their photometric 
redshifts were not used. 



6.1.2 Photometric redshift errors 

In order to be able to use photometric redshifts to eliminate 
physically associated pairs, we also computed photometric 
redshift error distributions using the matches from DEEP2. 
Of the 278 matches, 162 (58 per cent) were at r < 21 (with 
9 of those, or 6 per cent, having failed photometric redshift 
determination). Because high-redshift LRGs have different 
properties from the overall photometric sample, we explic- 
itly exclude them from this comparison, leaving 135 galaxies 
with which to study photometric redshifts. 

First, we define the terminology used in this section. 
For a set of true redshifts z and photometric redshifts Zp, 
we define the photometric redshift errors as S = z ~ Zp, 
and can construct average distributions p{S\z) and p{5\zp) 
in bins. The statement that the photometric redshifts are 
biased means that 

J p{S\z)5d5 ^0. (43) 

For our sample of 135 redshifts at r < 21, we find that overall 
there is a small photometric redshift bias, where the overall 
bias is 0.04, with (z) = 0.34 and {zp) = 0.30. When we look 
at the subsample of 64 galaxies below the mean redshift, the 
bias is zero to two significant figures and the scatter is 0.11. 
When we restrict to the subsample of 71 galaxies above the 
mean redshift, the mean bias is 0.08, and the scatter is 0.17. 

However, the calculation that is actually more relevant 
for the purpose of this paper is the "conditional bias," 

(S) = J p{5\zp)5dS (44) 

In particular, even if the bias is zero, then for a redshift 
distribution that peaks at redshift (z) = 0.34, we will have 
(S) > for Zp < (z) and (S) < for Zp > (z) simply due 
the scatter in the photometric redshifts. We also define a 
"conditional scatter," 



as = J p{S\zp){5-{S))^dS. (45) 



Zp range 








OS 


All 


0.30 


135 


0.045 


0.15 


Zp < 0.12 


0.04 


20 


0.15 


0.13 


Zp ^ 0.12 


0.34 


115 


0.027 


0.15 


0.12 ^ 2p < 0.25 


0.17 


28 


0.074 


0.13 


0.25 Zp < 0.35 


0.31 


28 


0.076 


0.21 


0.35 ^ Zp < 0.42 


0.38 


28 


0.017 


0.07 


Zp ^ 0.42 


0.49 


31 


-0.051 


0.11 



Table 1101 shows the mean conditional bias and scatter 
as a function of magnitude; table [TTI shows the same infor- 
mation as a function of photometric redshift. Information is 
included for r > 21 in Table ITI11 for informational purposes, 
but since we did not use photometric redshift information for 
those sources due to the large bias and scatter, they are not 
included in the calculations for Table fTTI A likely explana- 
tion for the increase in as at fainter magnitudes is the larger 
photometric errors which make photometric redshift deter- 
mination more difficult. The fact that our test of this pro- 
gram on the SDSS spectroscopic sample at r < 18, on galax- 
ies with negligible photometric errors, shows (5) = —0.003 
and as — 0.048, with as increasing with r magnitude if the 
sample is split into subsets, supports this hypothesis. Also, 
because we require zi > 0.02 and Za > zi + 0.1, we only 
use photometric redshifts greater than 0.12 in our analysis, 
though error information has been included in the tables for 
those less than 0.12. Our finding that photometric redshifts 
are, on average, biased low is in accordance with the same 
finding in H04 (based on the fractions of physically associ- 
ated pairs found using Zp < zi ~ 0.1). 

To illustrate the effects of photometric redshift errors. 
Fig. |H] shows the redshift distribution for the r < 21 sam- 
ple computed using photometric redshifts, with and with- 
out the errors taken into account according to Eq. 1251 and 
using the spectroscopic redshift distributions from DEEP2 
directly. As shown, the photometric redshift distribution has 
some strange features, including the peak near z — and 
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Figure 8. The redshift distribution for r < 21 non-LRG sources 
with successful photometric redshift determination, shown com- 
puted with spectroscopic redshifts and with photometric red- 
shifts, with and without error distributions taken into account. 

at z = 0.38; the distribution with the errors taken into ac- 
count looks significantly more reasonable, and is an excellent 
match for the histogram of redshifts from DEEP2. In fact, 
the distribution with photometric redshift errors taken into 
account appears to be a better match than the best-fit F- 
distribution. (Any discrepancies between the distributions 
m Figs. □ and H arise from the fact that figure |7| includes 
LRGs in the r < 21 sample but Fig. |H1 does not.) While 
we may have been concerned about using so few galaxies 
to determine the error distribution as a function of photo- 
metric redshift (by dividing 135 galaxies into five bins), it 
is apparent that even with so few galaxies, enough of the 
error distribution can be determined that this procedure is 
reasonably successful. 

While we have determined the photometric redshift er- 
ror distributions, we are still far from an understanding of 
how they affect the lensing signal. There are two effects that 
must be included: 

• The conditional bias, {5), must be included in our com- 
putation of E^^. Because (5) > for low Zp, the corrected 
redshift is larger than Zp, raising E^^ (lowering AE) once we 
make this correction. This effect is particularly important at 
low values of Zp, because E~^ is varying more rapidly with 
source redshift at low redshift. (However, since EJ^ is lower 
at low source redshifts, these sources get less weight, so the 
error they cause is not as high as one might naively expect.) 
At high redshift, (5) < 0, so the correction will lower the red- 
shift and E~^ (raise AE), but because E~^ varies so slowly 
with redshift in that regime, we expect that this will only 
be a minor correction. The majority of the photometric red- 
shifts used are around 0.35-0.4, where the conditional bias 
goes to zero, so we may expect that its effects will not be 
too large. 

• The conditional scatter as must also be included. At 
photometric redshift more than as above the lens redshift 
(i.e., for most sources in this work, due to our requirement 



that Zp > zi + 0.1), because EJ^ is an increasing function 
of source redshift with negative second derivative in this 
regime, the inclusion of the width of the error distribution 
will lower the estimate of E^^ (raise AE). This effect is 
counter to the effect of the conditional bias {5) at low Zp, 
and in the same direction as it at high Zp. However, at low 
Zp, close to zi, the effect of the 5- function in the second 
derivative at Zs = zi is that E^^ is actually higher than 
it would have been without the scatter, i.e. this effect goes 
in the same direction as the bias at very low photometric 
redshift. 

Figure |5] shows a plot of E~^ for lenses at several red- 
shifts, as a function of source redshift, for several different 
possibilities: straight acceptance of source photometric red- 
shift, use of the photometric redshift corrected for the con- 
ditional bias, and corrected for the full photometric redshift 
error distributions (actual, and best-fit). The fit for the dis- 
tribution was done by assuming it is a Gaussian with a fixed 
width but with a mean that is a linear function of Zp. This 
form was assumed because it is a simple parametrization 
that can still account for the fact that the error distribution 
changes sign as we go from low to high Zp. The effects of the 
mean correction (which raises E^^ for Zp < 0.4 and lowers 
it for Zp > 0.4) and the width of the error distribution can 
clearly be seen in this plot. 

Vertical lines show our cutoff oi Zp = zi -\- Q.l; fortu- 
nately, as shown, this limitation on the photometric red- 
shifts means that even just using photometric redshifts di- 
rectly, without accounting for the errors, is less likely to 
cause errors. Furthermore, as shown, the errors in E~^ for 
Zp ~ zi -\- Q.l are downweighted in importance due to our 
weighting scheme, with weights oc E^'^. Consequently, we 
can expect to see some cancellation, whereby the large in- 
crease in E^^ (decrease in signal) for small Zp — zi, which 
has small weight due to the weighting scheme, is cancelled 
by the modest decrease in E~^ (increase in signal) for large 
Zp and zi, which get considerable weight. The extent of the 
cancellation will be shown in H6.3I 

The difference in results between the best-fit Gaussian 
error distribution versus the actual error distribution is quite 
small, except for the range Zp ~ zi that is not used in this 
work; probably, the best-fit distribution gives higher values 
of EJ^ in this range because the fit distribution underesti- 
mates the tails that are present in the actual distribution, 
and the tails help to lower E~^ with the actual error distri- 
bution for Zp ^ zi. This result implies that when restricting 
to the range Zp > zi -\- 0.1 as in this work, it is unimportant 
whether or not the method of parametrizing the photometric 
redshift error distribution is able to account for the tails, as 
long as the basic shape (mean and width) is correct. How- 
ever, for analyses that include lower values of Zp, a more 
careful treatment of the error distribution is necessary. 

It is clear from these results that for different lim- 
its on source photometric redshift (e.g., without requiring 
Zp > zi +0.1), the results of photometric redshift errors will 
vary, and must be re-evaluated if we choose to relax this 
restriction. 

As illustrated by this discussion, the effect of photo- 
metric redshift errors is complicated, with several possible 
effects that will tend to push the lensing signal in opposite 
directions, and it is difficult to anticipate which effects will 
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Figure 9. J]'^^ (zi, Zs) for several values of lens redshift. For 
each case, a line showing our cutoff at Zp = z; + O.f is shown. 
As labeled on the plot, SJ^ was computed four different ways: 
using photometric redshifts directly; using them corrected for the 
conditional bias (5) only; and with corrections for the actual and 
the best-fit Gaussian photometric redshift error distribution. 

turn out to be more important than others for any given 
lens and source sample. Consequently, we must turn to the 
systematics test described in fj^to determine the effects of 
photometric redshift errors on AE. 

We also must consider the effects of errors in the photo- 
metric redshift error distribution. Statistical (Poisson) error 
can be determined via a bootstrap resampling of the DEEP2 
matches. The effects of LSS and systematic error are more 
complex, because correlations between photometric redshift 
errors are more difficult to model than the correlations be- 
tween the redshifts used to derive p[z). There are several 
concerns: statistical errors on photometry; systematic errors 
on colors; and deviations of galaxy SEDs from templates. For 
example, if the colors of galaxies depend on environments, 
we may expect different photometric redshift error distribu- 
tions for field galaxies versus those in groups or clusters. 

6.2 Error comparison 

Because we used three methods to compute error estimates 
on the signal (the analytic expression for the error, the ran- 
dom catalog errors, and the bootstrap errors), we compare 
the results of the three methods to ensure that they gave 
comparable results. In general, the results of the three meth- 
ods were quite comparable, with the exception of two situ- 
ations. First, for the lower luminosity bins, the bootstrap 
and random catalog errors were larger than those computed 
analytically by 15-20 per cent. Second, for the higher lu- 
minosity bins, the radial bins that required a sizable boost 
had larger errors computed using the bootstrap than via 
the other methods. Because there is some uncertainty on 
the boost factors (which was also computed using the boot- 
strap), the uncertainty on the product B(r)AE(r) is approx- 
imately B(r)AE(r)v'((crs/-B)2 + (crAE/AE)^) when com- 



puted using the bootstrap, raising the fractional error by 
roughly 5 per cent at small pair separations. The larger er- 
rors from the bootstrap are more likely to be correct since 
they take into account errors on the boost factor itself, so 
these are the errors that were used for the computation of 
ratios Ri^a,i3- 

The analytic errors are essentially noiseless since they 
are computed over millions of lens-source pairs. The random 
catalog errors had the most noise, since they were computed 
with only 24 random catalogs due to the computational ex- 
pense involved. If the error amplitude E{r) is fit to a power- 
law Ar~^, and the noise amplitude is defined as the frac- 
tional difference between the actual errors and the best-fit 
value, then the random catalog errors typically had a noise 
amplitude of 15 per cent. The bootstrap errors have little 
to no noise, with a maximum noise amplitude of 5 per cent 
even for LI and L6 (which had the fewest lenses) and 1 per 
cent or lower for the others. 

We also checked the covariance between the signal in 
different radial bins using the bootstrap for all luminosity 
bins and source samples. We can anticipate that these cor- 
relations may become important on angular scales for which 
the lens-source separation is comparable to the lens-lens sep- 
aration, since on those scales a given source will contribute 
to the calculation for more than one lens. We find a typical 
lens-lens separation to be 7', which corresponds to comoving 
transverse separation of 200 h^^kpc in LI, and as high as 
1100 h~^kpc in L6. Using the bootstrap resampling for 2500 
datasets, we find that for LI, there are correlations as high as 
0.2-0.3 starting at radii of 800 ^~^kpc between nearby radial 
bins (generally the 3 nearest radial bins in either direction) 
but little covariance between the bins for lower radii. The 
bootstrap procedure naturally introduces noise with stan- 
dard deviation 1/y/M (M is the number of bootstrap re- 
gions) in the correlation coefficients when they are <^ 1 (the 
noise is less for higher correlations), so the statement that 
there is little covariance means that the correlation coeffi- 
cients were consistent with being drawn from a Gaussian 
distribution N{0,1/A'I). For L6, no covariance is observed 
within the noise, even for the outermost bins that should 
be most correlated, since these correspond to a smaller an- 
gular scale than for the fainter (more nearby) lenses. Note 
that our procedure described in !JK| automatically takes any 
covariance between the radial bins into account because it 
uses the bootstrap. 

We also found that our measure of the signal in a given 
bootstrap-resampled dataset, AE as defined in ^ has a 
significant covariance between luminosity bins. This result 
is unsurprising considering that many of the same sources 
contribute to the result in different luminosity bins, though 
the weighting may differ significantly. We found correlations 
between the results in different luminosity bins as high as 
0.30 for adjacent luminosity bins, 0.10 for luminosity bins 
that were separated by one (e.g. L4 and L6) , and consistent 
with zero for further separations. This kind of correlation 
must be taken into account (as it is with the bootstrap) for 
any analysis that utilizes the results in several luminosity 
bins simultaneously. 

As an additional test, we also examined the distribution 
of values of AE for each luminosity bin and source sample, 
computed from the 2500 bootstrap-resampled datasets. This 
distribution was found to be Gaussian to high accuracy. The 
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Gaussianity is quantified via calculation of the skewness 

^ _ {(AS-JAS)f) 
^ (Var(AE))i-5 

and kurtosis excess 

{(AS-{AS))^) 

Sa = — 6 

(Var(AE))2 

both of which should be zero for a Gaussian distribution 
within the noise. For M bootstrap subregions, and re- 
sampled datasets, the expected variance of the skewness 
is Var(s3) = 6(M-i + N-^) « 6/M (M = 250 and 
N = 2500 in our case) a nd of the kur tosis is Var(s4) = 
24(M-i + iV-^) ^ 24/M iCramerlll94 '6l). The values of S3 
and S4 calculated for a number of different sources samples 
and luminosity bins were consistent with being drawn from 
random distributions with mean zero and approximately 
those variances, and therefore consistent with Gaussianity. 

6.3 General systematic tests 

In this section, we present results of tests for systematics 
described in f^l 

6.3.1 Random points test 

As mentioned in H4.ll we found a nonzero signal computed 
around random points in the survey region, indicating the 
presence of a systematic shear 7sys in the source catalog 
that must be accounted for. The shape and amplitude of 
this signal varies with luminosity bin and source sample, 
because it is related to angular scale (and therefore difi'erent 
transverse separations for lens samples at different redshifts) 
and likely varies with source R2 as well. 

In principle, if the systematic shear is uniform across 
the survey, then it will only be revealed on large scales. If 
the distribution of sources around lenses is circularly sym- 
metric, then the contribution of the systematic shear to the 
signal averages out, so only edge effects on large scales will 
show the presence of systematic shear. The effect of this 
spurious ellipticity is most noticable for the narrow south- 
ern stripes, of approximate width 2.5 degrees; our decision 
not to use them for the work in this paper reduced the ob- 
served random catalog signal by approximately a factor of 
four. However, it is unlikely that the systematic shear is per- 
fectly uniform, and if it fluctuates on small scales then we 
may also see a random catalog signal at small transverse 
separations. 

As an illustration, fi gure lTHl shows the mean random cat- 
alog signal for all 6 luminosity bins, computed with r > 21 
sources. As shown in this figure, there is a clear signal from 
the random catalogs at large radius; the statistical error due 
to this signal can be taken into account by the bootstrap. 
For the brighter sources, r < 21, the random catalog signal 
has slightly different shape, and is shown in figure 1111 As 
shown there, the signal is zero at low transverse separation, 
then becomes positive in the fainter bins, decreasing to zero 
at larger radii. While this occurs at different radii for L1-L3, 
the corresponding angular scale is same: roughly 25'. This 
number is of significance in the SDSS as the separation be- 
tween adjacent camcols; however, the reason it is showing 
up here is not entirely clear. For the characteristic redshift 
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Figure 10. Average signal from 24 random catalogs, with r > 21 
sources, and the dashed line showing AS(r) = for reference. 
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Figure 11. Average signal from 24 random catalogs, with r < 21 
sources, and the dashed line showing AS(r) = for reference. 



of L4, 25' corresponds to 2.2 /i~^Mpc, which is why the sig- 
nal does not go to zero in that bin; however, it is clearly 
declining by the maximum radius of 2 /i~^Mpc. 

If the systematic shear is constant across the survey, 
then one way to eliminate or at least reduce it would be to 
only keep lenses that have a reasonably circular distribution 
of sources around them, since the systematic shear along 
the scan direction will then cancel out of the calculation of 
AE. We implemented this cut, requiring the ellipticity of 
the distribution of sources around the lens to be less than 
0.2, but found that this cut did not significantly decrease 
the random catalog signal; consequently, we conclude that 
the systematic shear ffuctuates on a scale smaller than the 
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typical distribution of sources around a lens, and did not use 
this cut for the work in this paper. 



6.3.2 45- degree test 

In this subsection we describe results of the 45-degree test 
described in H4.2I This test was performed using the boot- 
strap with 2500 subsamples. In order to check for a system- 
atic dependent on transverse separation, we did the test in 3 
radial bins: 30-100 /i"^kpc, 100-600 /i"^kpc, and 600-2000 
/i~^kpc, as well as for the full range 30-2000 ft^^kpc. The 
test was done for each lens luminosity sample separately, 
and the results were also checked for the combined result 
averaged over lens luminosity. 

Note that the 45-degree random catalog signal was 
nonzero for all lens-source subsamples, and therefore had 
to be subtracted just like for the non-rotated signal. Un- 
like for the real signal, rather than averaging AE = 
(AS(r/l/i"^Mpc)°-*^), we average AS45 = (AE45) since we 
do not expect this quantity to vary widely over the radial 
range used. We checked the error distribution for AS45 with 
the bootstrap and, since it was found to be Gaussian, use a 
simple test to determine the statistical significance of the 
45-degree signal. Since the test involves using just a single 
variable (AE45 averaged over a given radial range), we use 
the distribution with a single degree of freedom to calcu- 
late p-values, the probability for AE45 to exceed the given 
value by chance assuming that it really is consistent with 
zero. Problems that cause a pure calibration bias alone have 
no effect on p-values since they change both the amplitude 
of the signal and the errors. 

Results for the 45-degree test for all source samples and 
radial ranges are shown in table ll2l averaged over luminosity 
bins. As shown there, we have no evidence for a 45-degree 
signal, which indicates that we are not dominated by sys- 
tematics, since most contribute to the 45-degree signal on 
the same level as the non-rotated signal. Results for each lu- 
minosity bin are not shown, in order to simplify the presen- 
tation of results, but we also have no evidence of 45-degree 
signal in individual luminosity bins. Fig. 1121 shows the 45- 
degree signal averaged over luminosity for each source sam- 
ple. 



6.3.3 Star-galaxy separation 

As described in H4.3I we have good reason to believe that 
stellar contamination in the source catalog, even at faint 
magnitudes, is minimal (~ 2 per cent for r > 21). However, 
we tested this hypothesis by computing signal for lenses at 
low galactic latitude, defined as sin 6 < 0.7, and compared 
the results against the signal at high galactic latitude. In 
principle, stellar contamination can cause the signal at low 
galactic latitudes to be lower than the signal at high galactic 
latitudes. For this test, in order to maximize signal-to-noise, 
the signal was computed using lenses in L3-L6 combined, 
since most of the signal comes from those bins. This process 
was done for both the r < 21 and r > 21 sources separately, 
where we expect most stellar contamination to be in the 
r > 21 sample. 

The results of the test are shown in table [T^ As shown, 
while the signal in the low galactic latitude sample is lower 



Table 12. AE45, its standard deviation (T45, and the probability 
to exceed the measured value by chance if it is truly consistent 
with zero, for all source samples and radial ranges. The results 
are shown for the signal averaged over lens luminosity. Because 
the r > 21 sample includes some LRGs, the results shown for 
those two samples are not independent; the results for all source 
samples combined take their covariance into account. 



Radial range {h ^kpc) 


AS45 (hMQ/pc^) 


a'45 


p- value 


r < 21 


sources, without LRGs 






30 < r < 100 


1.70 


2.45 


0.49 


100 < r < 600 


-0.03 


0.45 


0.96 


600 < r < 2000 


0.38 


0.24 


0.12 


30 < r < 2000 


0.33 


0.22 


0.13 


r > 21 sources 


30 < r < 100 


-1.72 


3.04 


0.57 


100 < r < 600 


-1.17 


0.55 


0.04 


600 < r < 2000 


-0.13 


0.25 


0.62 


30 < r < 2000 


-0.25 


0.23 


0.26 


LRGs 


30 < r < 100 


6.37 


5.31 


0.23 


100 < r < 600 


-0.17 


0.91 


0.85 


600 < r < 2000 


-0.21 


0.45 


0.64 


30 < r < 2000 


-0.19 


0.42 


0.66 


All 


30 < r < 100 


1.44 


1.90 


0.45 


100 < r < 600 


-0.47 


0.34 


0.17 


600 < r < 2000 


0.09 


0.18 


0.63 


30 < r < 2000 


0.02 


0.17 


0.88 


AE^g, averaged over luminosity 
















r [kpc/h] 

Figure 12. 45-degree rotated signal for each of the three source 
samples, averaged over all luminosities. Many radial bins are aver- 
aged to reduce the noise and make the plot easier to understand. 
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Table 13. Results of the systematics test for various subsamplos, including non-Gaussian 68 per cent and 95 per cent confidence 
intervals. In all cases shown here the lenses are from luminosity bins L3-L6. 



Sources ct 


Sources /3 




68% CL 


95% CL 








lower 


upper 


lower 


upper 




Stellar contamination test, ^6.3.31 












r < 21 without LRGs, sinb < 0.7 


r > 21 without LRGs, sin 6 > 0.7 


0.86 


0.78 


0.94 


0.70 


1.03 


r > 21, sin 6 < 0.7 


r > 21, sin 6 > 0.7 


0.92 


0.84 


1.01 


0.76 


1.11 




Seeing comparison, i6.3.4l 












r < 21 without LRGs, bad seeing 


r < 21 without LRGs, good seeing 


0.99 


0.92 


1.08 


0.85 


1.17 


r > 21, bad seeing 


r > 21, good seeing 


1.00 


0.92 


1.09 


0.84 


1.19 




R2 comparison, i|6.3.5l 












r < 21 without LRGs, R2 < 0.55, r-band 


r < 21 without LRGs, R2 > 0.55, r-band 


1.11 


1.02 


1.20 


0.94 


1.30 


r < 21 without LRGs, R2 < 0.55, i-band 


r < 21 without LRGs, R2 > 0.55, i-band 


1.12 


1.04 


1.21 


0.96 


1.30 


r > 21, R2 < 0.55, r-band 


r > 21, R.2 > 0.55, r-band 


0.89 


0.80 


0.99 


0.71 


1.10 


r > 21, R2 < 0.55, i-band 


r > 21, R2 > 0.55, i-band 


1.24 


1.13 


1.37 


1.03 


1.50 




r versus i band comparison, ilti.3.6l 












r < 21 without LRGs, j-band 


r < 21 without LRGs, r-band 


1.06 


1.00 


1.12 


0.95 


1.18 


r > 21, j-band 


r > 21, r-band 


1.04 


1.00 


1.07 


0.97 


1.11 



than that in the high galactic latitude sample, the signals 
are statistically consistent with each other for r < 21 and 
r > 21 cases. This result, combined with the results from 
GOODS shown in H4.3I supports our assertion that stellar 
contamination is negligible in our catalog. For 2a bounds on 
the error in the signal due to stellar contamination, we use 
the results from GOODS with the binomial distribution as 
in 

6.3.4 Seeing dependence of calibration 

As described in >14.4I we also used the systematics test to 
compare the signal computed with sources that had PSF 
sizes greater than and less than the median. The results 
of the test using L3-L6 lenses are shown in table 1131 As 
shown there, the signals are statistically consistent with each 
other, indicating that we do not have to worry about seeing- 
dependent calibration of the shear. 

6. 3. 5 R2 dependence of calibration 

As described in H4.5I we used the systematics test to com- 
pare the signal computed with sources with R2 < 0.55 versus 
with R2 > 0.55. This exercise was done in both the r and 
i bands separately, and results are shown in table 1131 As 
shown there, there is no evidence at the 2g level for a sys- 
tematic calibration offset that depends on resolution factor 
R2. 

This result is important because recent tests have in- 
dicated that the source catalog used in H04 did have an 
-R2-dependent calibration, with the calibration changing by 
as much as 50 per cent from the lowest R2 = 1/3 to the 
highest. When averaging over all r < 21 sources, this effect 
caused the calibration for that paper to be too low by ap- 
proximately 15 per cent (only r < 21 sources were used for 
that work; since the LRG sample and the r > 21 sample is, 
on average, at lower R2, the offset is even worse for those 
samples). Tests to determine the source of the problem were 
not definitive, but seemed to rule out selection biases. The 



basic conclusions of that paper, namely the 99.9 per cent 
stat-|-sys confidence intervals on the intrinsic shear, are es- 
sentially unaffected by this problem, since we assumed up 
to 20 per cent shear calibration bias when computing the 
confidence intervals. However, this finding highlights the im- 
portance of our results here, which is that this catalog does 
not have a statistically significant _R2-dependent calibration. 



6.3.6 Systematic differences between bands 

While for most tests we used the shape measurements aver- 
aged over both r and i bands, we also compared the signal 
computed using only one band versus the signal computed 
using the other. This test was done separately for r < 21 
and r > 21 sources. As shown in table [T^ there is no sign 
of a systematic discrepancy between the signal computed 
using the shape measurements in either band individually, 
so to improve statistics we will henceforth use the shape 
measurement averaged over bands. 



6.3.7 Boosts 

The boost factors Bi^a{r) generally follow expected trends, 
decreasing with radius and increasing with luminosity. Fur- 
thermore, they are largest with the r < 21 sources, which 
have the largest overlap in redshift range with the lens sam- 
ple; less with the r > 21 sources; and the smallest with 
the LRG sample, since those are chosen specifically to avoid 
contamination from low-redshift [z < 0.35) galaxies. 

A systematic effect at the smallest radii (r < 30 
h^^kpc) is revealed by the boost factors. While they should, 
in principle, be monotonically decreasing, we found that for 
the brightest lens samples, they actually increase from 20- 
30 /i~^kpc, then follow the expected trend of decreasing for 
r > 20 /i~^kpc. In H04, this trend appeared for even larger 
radii; however, for that work, our source catalog did not 
include deblended children, so this result is expected. This 
source catalog does include deblended children, so we do not 
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stars as lenses, LRG low surface-brightness sources 




100 1000 10 100 

r [kpc/h] 8 (arcsec) 



Figure 13. The top plot shows B{r) for r > 21 sources and all 
luminosity bins. The bottom shows B{r) for L5 with all source 
samples as labeled on the plot. 



Figure 14. Correlation function between bright stars and the 
high-redshift LRG sample, for low surface-brightness LRGs only, 
as a function of star apparent magnitude. 



expect a loss in lens-source pairs for small transverse sepa- 
ration due to the loss of deblended children. However, very 
bright galaxies, for which this effect is noticable, are large 
enough that at those separations, their hght could make it 
impossible to detect much smaller, faint galaxies. This could 
explain the fact that this effect is more noticable for the 
fainter r > 21 sources than it is for the r < 21 sources. In 
order to avoid strange selection effects due to this problem, 
this work only uses r > 30 /i~^kpc for the analysis. 

Plots of the boost factor are shown in Fig. 1131 The top 
plot shows the trends of the boost with luminosity bin for 
the r > 21 sources; the bottom shows the variation of boost 
with source sample for L5 lenses. The problem with B{r) at 
r < 30 /i~^kpc is clearly visible in this plot, especially for 
L6. 

A close examination of the boost factors reveals a sys- 
tematic effect at small transverse separations that affects 
any analysis of galaxy-galaxy weak lensing done using SDSS 
data. From plots of the boost factor at small transverse sepa- 
ration computed using LRGs as sources, it became apparent 
that there is a deficit in lens-source pairs below a certain ra- 
dius. This deficit manifests itself in the boost factor actually 
dipping below 1 (i.e., < 0), which is not physical. This 
deficit also is present for the r < 21 and r > 21 samples, 
though it is harder to notice because of the fact that there 
are physically-associated pairs at the affected radii. We de- 
termined that this effect is due to either an instrumental 
effect or something in the Photo pipeline (rather than due 
to some selection effect in our catalog, error in signal com- 
putation, or actual physics) due to the following factors: (1) 
it occurs at a particular angular (not physical) scale, most 
noticably around 50" though it extends up to about 90" , (2) 
the effect is noticable with two different catalogs, the one 
from this work and from H04, which were processed with 
completely independent software pipelines from the Photo 
outputs, (3) it is most noticable for lenses that are bright in 
apparent magnitude, (4) it is most noticable for low surface- 



brightness sources, and (5) we see the effect even if we com- 
pute the signal around bright stars instead of around galax- 
ies. While a hint of this effect can be seen for B(r) from 
LRGs in Fig. 1131 it is more obvious in Fig. 1141 which shows 
a plot of the correlation function between bright (r < 19) 
stars and high-redshift LRGs, where there is a non-zero cor- 
relation function at small separations because of low-redshift 
quasar contamination in the star sample. This plot was com- 
puted using only low surface-brightness sources (those with 
a value below the median), and is shown as a function of 
star magnitude. 

The effect turns out to be due to a problem in deter- 
mination of the sky level around bright objects (stars or 
galaxies). The sky level is determined in overlapping boxes 
of size 256 pixels (approximately 100") and linearly inter- 
polated on the 128 pixel (50" scale), using a 2.3o" clipped 
median® . We expect the sky level near bright lenses (within 
about 1') to be higher than the global sky because Ught from 
the lens galaxies extends out to a large angular separation, 
and we do not want that extra light to be included in the 
flux from the sources in that region. So, we do want the sky 
level to be influenced by the presence of the bright objects. 
However, our results suggest that the sky level determina- 
tion is overly influenced by the presence of bright objects, 
with the sky level being set too high near them. The problem 
is different for each band. This change in the sky level has 
several effects: (1) some sources are not detected at all by 
the Photo software, (2) some are detected, but due to the 
faulty sky level, the magnitudes, colors, and shape measure- 
ments determined by model fits to the hght profile are off, 
and therefore they are not included in our source catalog 
because of magnitude or i?2 cuts or OBJC_TYPE failure, 
or are included but with these quantities computed wrong. 



° For more information about the sky level determination, see 
_Attp://www.sdss.org'/ dr3 / algorithms / sky. html 
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and (3) our computed redshift distribution of these sources 
is likely wrong because of the incorrect fluxes. Furthermore, 
because this effect can vary from band to band, the colors 
are incorrect as well, affecting both photometric redshift de- 
termination and selection of the high-redshift LRG sample. 

To test that this effect is indeed the cause of the ob- 
served effect, we used the Photo reductions to compute the 
averaged sky level as a function of angular separation from 
bright galaxies (we except the effect to differ around stars 
versus galaxies, because galaxies are extended and there- 
fore affect more pixels). This computation was done in each 
band, and we verified that the sky flux level f^ky around 
bright galaxies is indeed higher than the baseline value com- 
puted around faint galaxies, by a quantity we call As^j, in 
each band. For such small angular separations, we expect 
that fsky should be higher than that baseline, so much of 
the rise at small angular separations is not spurious. Fig. 1151 
shows a plot of the average sky flux in the r band as a func- 
tion of angular separation from lens galaxies with apparent 
magnitudes as labeled on the plot. We can make a naive 
calculation of the effect of this problem (assuming that all 
of Asfcy is spurious and that the only effect was to change 
the surface brightness of the faint source without changing 
its size) at 40" for a circular source galaxy 4" in diameter at 
magnitude 21.5. The flux for such an object is 2.5 nanomag- 
gies, and since its area is 12.5 square arcsec, then if the 
lens is at r ~ 16, with Asky ~ 0.4 nanomaggies per square 
arcsecond, its true flux should have been 5.0 nanomaggies, 
a full factor of two higher, giving it a true magnitude of 
20.8, a highly significant offset from the observed value. For 
lenses around r ~ 17.5, a large fraction of the sample, Asky 
is smaller, but we still expect a "true" magnitude of 21.0. 
Of course, this is just a crude estimate, and a more careful 
calculation of what happens to the fits to the light profiles 
when the sky level is changed is necessary. This calculation 
is actually quite complicated because the change in sky flux 
changes the effective radius of the source from the fits, so not 
only is the flux underestimated within the area of the galaxy, 
light is also lost due to the underestimated size. However, 
when we tried applying this naive correction, we found that 
we overcompensated for this effect, so it seems clear that 
some of Asky is not spurious (as would be expected for such 
large, bright lenses, for which some of the light does indeed 
extend to these radii). 

There are several implications of this effect for our work. 
First, while figure lTH shows what looks like a dip in the boost 
factor below 90" and most prominently at 50" , with it then 
rising at lower separations, our findings suggest that the 
boost factor is actually underestimated for all radii below 
the 50" level, though it is impossible to actually see this 
effect due to the physically-associated pairs. (Note, however, 
that if the problem is due to the lens galaxies light profile 
falling off faster than the linear interpolation scale of the sky 
level, then the effect should be worst around the 50" scale 
and, if anything, somewhat better at smaller separations.) 
It seems that we have lost pairs either because the sources 
were not identified by Photo at all, because their fluxes and 
sizes were miscalculated so they failed our magnitude, R2, 
or OBJC-TYPE cuts, or (for the high-redshift LRG sample) 
because their colors were wrong. Furthermore, for the pairs 
that were included, we must be concerned about incorrect 
shape measurement and redshift distributions. In general. 



Average sky flux (r-band) around galaxy lenses 
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Figure 15. Average sky flux in nanomaggies per square arc- 
second as a function of angular separation from bright galaxies, 
computed in tfie r band. Tfie lowest line, tfie sky flux around very 
faint galaxies, is shown as a reference, and is flat, as expected. 

this error will tend to lower the measured signal, since the 
underestimated boosts lower the signal, and the effect of 
underestimating the flux is, on average, to make us think the 
galaxies are fainter and more distant, overestimating 
and underestimating AE. Since at 50" we see a 5 per cent 
effect, and it may be worse at smaller 6, though likely not 
much worse since the problem is due to the interpolation, we 
place a 2a bound on the error due to this effect of [—0.15, 0], 
but we emphasize that more study is necessary to quantify 
this effect more precisely. 

We attempted to correct for this effect by excluding 
regions near objects (stars or galaxies) with saturated pixels, 
or about 1.5 per cent of the sample, and regions within 60" 
of r < 16 galaxies, or about 8 per cent of the sample.^ 
However, with these restrictions, we still see the effect at 
approximately the same level as before. This fact indicates 
that most of the problem is due to the r > 16 galaxies that 
comprise the majority of our lens sample rather than due 
solely to the very brightest subsample of lenses. 

In future work, we will attempt a more detailed mod- 
eling of this problem. For most of our results, we do not 
attempt to correct the boost factors, which are clearly un- 
derestimated, though we will show what happens to the re- 
sults if we try a naive correction; since only small separations 
are involved, the change in the signal averaged over radius 
due to this effect is small. Furthermore, while we have esti- 
mated rather large values for the magnification bias at small 
separations for the r < 21 and LRG samples, this sky level 
problem makes it effectively impossible to make any detec- 
tion of magnification bias with the SDSS using MAIN spec- 
troscopic sample lenses and working at angular separations 
of less than approximately 90" . 

The final test performed in this section was to compare 

^ We thank Ryan Scranton for providing the necessary masks 
and mask utilities to allow us to perform this test. 
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the signal computed for the bright and faint halves of the 
brightest bin (L6) separately and then averaging them (i.e., 
taking the average value of B{r)[/S.T.{r) — /S.Tirand{r)\) versus 
the signal computed by averaging over the whole luminosity 
bin, as we usually do, to get the average values of B(r) and 
AE(r) — AEraridl?"). The concern is that due to the non- 
uniformity of the boost factor, with both the boost factor 
and signal rapidly increasing in this bin, our averaging pro- 
cess may bias the signal. Our tests indicate that for the both 
r < 21 and r > 21 sources, the effect of this averaging is less 
than a 1 per cent effect even for L6; consequently, it must be 
negligible for the other bins, for which the signal and boosts 
are varying less rapidly. 

6.3.8 Intrinsic Alignments 

When computing the expected contamination due to intrin- 
sic alignments, we must do the computation separately for 
each luminosity bin i and source sample a because the con- 
tributions may vary with lens mass (and therefore luminos- 
ity) and with source sample (more distant source samples 
will have fewer physically associated lens-source pairs). Fur- 
thermore, the procedure should be done as a function of 
radius, since limits on the intrinsic shear A7i„t vary with 
radius. We split the data into 3 radial ranges j with lim- 
its rmin,i and Vmax,]. This estimate assumes that the boost 
factors B{r) are entirely due to physically associated pairs, 
and ignores the magnification bias estimates in table |S| 

The signal contamination due to intrinsic alignments 
can be computed as follows. First, we determine the fraction 
of lens-source pairs that are physically associated. For each 
lens sample i, source sample a, and radial bin j, we compute 



/i''"'(^.) 



Y.l:Z^'{B..^{r)-l)A{r)ntl\{r) 



(46) 



where A{r) is the area of the annular bin centered at r, and 
the value of fc is determined via the bootstrap. 

Then, for a given radial range, we can compute the es- 
timated contribution of intrinsic alignments to the lensing 
signal via 



AE, 



(r.)=/i''"^(r,) 



(47) 



In order to avoid excessively low (or even non-physical 
negative) values of fc, we had to correct for the sky level ef- 
fect that suppresses B{r) at small radii. This correction was 
done as follows. For the LRG sample, for which this effect is 
most noticable, we examined B(r) for all lens samples. We 



found the minimum value -B„ 



at r = r„ 



and assumed 



that the effect was actually the same at all r < rmin (and 
could not be observed because B{r) > 1 for those radii). So, 
for r ^ rmin, we multiplied the signal and B(r) by 1/Bmin', 
for r > rmin, we multiplied the signal and B{r) by 1/B{r) if 
B{r) < 1, and by 1 otherwise. This correction is very rough, 
since it assumes that the sky level effect is the same for all 
source samples, and that the sky level effect only changes 
the boosts without having an effect on the shear or redshift 
calibration. However, our estimate of contamination from 
intrinsic alignments in table [Til is still conservative, because 
we ignore the fact that magnification bias contributes to the 



boost factors at small radii. For the purpose of this calcu- 
lation, the 99.9 per cent stat-|-sys confidence intervals from 
H04 have been reduced by a factor of 0.6 so they will be 2a 
rather than 3.4a bounds. 

For this calculation, we assumed that A"/int for the r > 
21 and LRG samples is the same as that for the r < 21 
sample. While this assumption is not fully justifiable, and 
means that the actual estimates of the contamination should 
not be taken too seriously, the size of the confidence intervals 
are likely still trustworthy. 

Here we discuss the results in table ll4l The entries with 
zbcxD as bounds are those for which H04 could not place any 
constraint on A^int. As shown, because of the poor con- 
straints at low luminosities (due to the fact that the contam- 
ination fractions fc are so low that there were very few lens- 
source pairs with which to study the intrinsic alignments), 
no strong constraints on the contamination of signal due to 
intrinsic alignments could be placed in H04. However, the 
fact that the average signal (AE) for these ranges is entirely 
reasonable suggests that the contamination by intrinsically 
associated pairs is not causing a large problem. For r < 21, 
use of p{z) rather than photometric redshifts leads to very 
large contamination fractions fc and therefore very loose 
constraints on AEi„i. The fact that the contraints on in- 
trinsic alignments for r<21,r>21, and LRG samples are 
quite different, generally tightest on LRGs and loosest on 
r > 21, yet the results agree quite well (as will be shown 
in H6.4.4II suggests once again that we do not have a ma- 
jor problem with contamination by intrinsic alignments. For 
small radii, the r < 21 (with Zp) and r > 21 samples, we 
assign 2a uncertainties due to intrinsic alignments of ±20 
per cent for L3-L5 and ±60 per cent for L6; for LRGs, ±5 
per cent for L3-L5 and ±15 per cent for L6. For large radii, 
the r < 21 (with Zp) sample, we assign 2a uncertainties of 
±5 per cent for L3-L5 and ±15 per cent for L6; for r > 21, 
±15 per cent for L3-L5 and ±25 per cent for L6; and negligi- 
ble for LRGs. These constraints were assigned by comparing 
the 2a constraints on AEi„t to the characteristic AE in the 
table. 



6. 3. 9 Corrections for non-volume limited sample 



As described in H4.9I we must consider the effects of dif- 
ferences in the mean weighted luminosity of the subsample 
when computed with different source samples or redshift 
distributions for the same sample. Using a simple model for 
AE(Z/), we can compute the expected change in the signal 
from Eq. 1331 and expected ratios 



■i,(y.j3 



i± 



^(AE) 
AE ■ 



(48) 



We have no need for these corrections for LI and L2, 
since for those samples all source samples are essentially 
at infinite redshift, and SL is exceedingly small. Using our 
results for L3-L6, we find AE'(L)/AE(L) = 0.19, 0.43, 0.48, 
and 0.33 respectively. 

We must consider several types of corrections. First, we 
consider signal comparisons for r < 21 sources, with differ- 
ent methods of determining E^^. The maximum variation 
in luminosity SL found when comparing among the methods 
is 0.0023, 0.0028, 0.0046, and 0.0027 for L3-L6 respectively, 
which gives expected ratios of 1.0004, 1.0012, 1.0022, 1.0008, 
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Table 14. Estimated contamination from intrinsic alignments for each lens luminosity and source sample at the 2a level; the average 
signal in that radial range, (AS), is shown for comparison. 
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-1+21 
1-30 


L6 


[100, 300] 


70 


0.17 


Q + 8 


0.58 


16lti 


0.23 


c:+12 
0-15 


0.021 


0.3+n 


L6 


[300, 992] 


23 


0.056 


n + 3 
^-2 


0.14 


7+10 
'-9 


0.062 


2+" 
^-3 


0.003 


•J-i-o.i 



far too small to be detected considering our statistical er- 
rorbars; consequently, we do not apply any correction when 
comparing between the various methods of computing 
for the r < 21 sample. Next, we consider comparisons within 
the r > 21 source sample, different methods of comparing 
E"^. The maximum variation in luminosity 5L is 0.0017, 
0.0033, 0.0041, and 0.0017, again giving expected ratios sim- 
ilar to those for r < 21, far too small to measure. For com- 
parisons within the LRG sample, there was no detectable 
difference in the weighted luminosity since the source red- 
shift distributions varied very little and are all essentially at 
infinity. 

The final test is the comparison of the luminosities be- 
tween the different source samples (r < 21, r > 21, and 
LRGs) to ensure that the ratio test should not have some 
correction due to their difi'erent weighted luminosities. In 
general, the weighted luminosity of the r < 21 sample was 
lowest, and the LRG sample was highest. We find maximum 
SL of 0.0092, 0.016, 0.016, and 0.006 (comparing LRG to 
r < 21), giving expected ratios of 1.002, 1.007, 1.006, 1.002. 
Given that these ratios represent a difference in overall cal- 
ibration of less than 1 per cent, much lower than our 2a 
bounds on the calibration, we find that we need not apply 
any correction for the change in mean weighted luminosity 
when comparing the signal from different source samples. 

6.4 Redshift systematic tests 

In this section, we describe our application of the system- 
atics test to assess which methods of redshift distribution 



determination are optimal. First, for each group of sources 
(r < 21 without LRGs, r > 21, and LRGs), we compared 
the various methods of determining the redshift distribution. 
Next, we compared the signal determined using each of these 
three samples, to make sure that our results are consistent. 
In each case, this test was done individually for each lumi- 
nosity bin, because we can expect that different lens redshift 
samples will sample difi'erent parts of the source redshift 
distribution, and therefore results may differ systematically 
across the luminosity bins. However, we also use the results 
averaged over luminosity bins for greater statistical power. 
Plots of the signal will be shown in H6.4.4I 

All errorbars shown in this section are statistical er- 
rorbars from the computation of signal only; they do not 
include systematic error on the shear or redshift distribu- 
tions, or statistical error in the determination of redshift 
distributions or photometric redshift error distributions. 



6.4-1 Bright (r <21) sources 

For the r < 21 sources, several methods of redshift distribu- 
tion determination are compared: 

1. The redshift distribution p{z\r) from COMBO-17, 

2. The average redshift distribution p{z) from DEEP2, 

3. Photometric redshifts from kphotoz directly, 

4. Photometric redshifts, corrected for the conditional 
bias {S{zp)) only, 

5. Photometric redshifts, corrected for the best-fit error 
distribution, and 
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Table 15. A comparison of the overall calibration of the signal 
when computed using the 6 methods considered in this section for 
the r < 21 sources, for 30 < r < 2000 /i~^kpc, all luminosity bins 
with significant signal. The ratios Ri^a,f3 are given along with the 
95 per cent confidence interval. 



Sources a Sources (3 


Ri,a,f3 




L3 


L6 


L3-L6 



DEEP2 


Zp 


0, 


,89: 


DEEP2 


Zp, fit p{5) 


0, 


.87; 


DEEP2 


Zp, bias 


0, 


.92; 


DEEP2 


Zp, p{S) 


0, 


.89; 


DEEP2 


C0MB017 


1, 


■oi; 


Zp 


Zp, fit p{S) 


0, 


.97; 


Zp 


Zp, bias 


1, 


.03; 


Zp 


Zp, p{S) 


1, 


■oo; 


Zp 


C0MB017 


1, 


.13; 


Zp, fit p{5) 


Zp, bias 


1, 


.06; 


Zp, fit p(<5) 


Zp, p{5) 


1, 


.02; 


Zp, fit p((5) 


C0MB017 


1, 


.16; 


Zp, bias 


Zp, p{5) 


0, 


.97; 


Zp, bias 


C0MB017 


1, 


.lo; 


Zp, p{S) 


C0MB017 


1, 


.14; 



+0.16 
-0.13 
+0.15 
-0.13 
+0.16 
-0.13 
+0.16 
-0.13 
+0.06 
-0.05 
+0.05 
-0.04 
+0.05 
-0.05 
+0.04 
-0.05 
+0.20 
-0.17 
+0.05 
-0.05 
+0.05 
-0.04 
+0.21 
-0.17 
+0.03 
-0.04 
+0.19 
-0.17 
+0.20 
-0.18 



1 06+"-^^ 

^•^°-0.15 
^^•^'-0.13 
^•^'-0.15 

1.01+°i« 



1.01 
0.92 



0.14 
+0.09 
0.09 
+0.05 
0.04 
+0.05 



l-011o.05 

QS+O-OS 

iJ-y3_o.o4 

95+"-" 

'J-^'^-O.12 
J^-J^^-0.05 
^^•'^^-0.05 
^•'-'■'-0.13 
^^•^^-0.05 

^^•^^-0.12 

1 00+0-15 
l-'-"J-0.13 



00+0.07 
"■''^-0.06 

Q4+0-06 
^•='*-0.06 

1 02+0-0'' 
^•^^-0.06 

Q7+0.07 

-0.06 

1 00+0-03 
^•"^"^-0.02 

95+0-02 
^•^^-0.02 

1 03+0-02 
^•'-''^-0.02 
p, Qo+0.02 
"•9^-0.02 

,+0.07 
-0.06 
5+0.02 
-0.02 

1 ns+0-02 
i-'J'^-o.oi 

1 oe+o-o'^ 

^•^"-0.06 

95+0-02 
'J-^^-o.oi 

n QS+O-06 

'j-y^-0.06 

1 Qo+0.06 
^•'-'■'-0.06 



1.01] 

1.08^ 



6. Photometric redshifts, corrected for the actual (noisy) 
error distribution. 

The first two methods involve averaging over a redshift dis- 
tribution to determine E^^ without the use of photometric 
redshift information to eliminate physically associated pairs. 
The final four methods use photometric redshifts to select 
sources with Zp > zj -I- 0.1 to eliminate physically associ- 
ated source-source pairs. The two methods involving distri- 
butions use all r < 21 galaxies, including those for which 
photometric redshifts were not available and those passing 
the high-redshift LRG cuts; the last four methods only in- 
volve non-LRGs, since the photometric redshift error distri- 
butions for non-LRGs and LRGs are quite different. 

In this section, we compare the results using 30-2000 
/i~ikpc, without subdividing this region, since changes in 
the redshift distribution only affect the amplitude of the 
signal, and not its slope. Furthermore, we only use luminos- 
ity bins L3-L6 since the majority of the signal is in those 
bins; adding LI and L2 actually lowers the S/N slightly. 
Since changes in redshift distribution will tend to have a 
greater effect on the higher-redshift lenses, we are also in- 
terested in the results as a function of luminosity bin. Ta- 
ble llSl shows the results for the ratios Ri,a,0 for the selected 
ranges of radii, including 95 per cent confidence intervals 
computed via the non-Gaussian error computation for cor- 
related variables (correlations between the signal computed 
using method 1 or 2 and methods 3-6 were of order 0.75, 
and correlations among methods 3-6 were of order 0.98). Be- 
cause of the high correlations, the ratios were determined to 
high accuracy. Results are shown for L3, L6 (so that com- 
parison can illustrate trends in lens luminosity), and the 
average signal in bins L3-L6. 

We can see from table lT^ that at 95 per cent confidence 
level (CL), the results from these different ways of comput- 
ing EJ^ all give results that have overall calibration within 
15 per cent of each other. First, we compare the two redshift 



distributions shown in the table; this comparison is on line 
5 of the table. As shown, the distributions from COMBO-17 
and DEEP2 give results that have no difference in overall 
calibration, and this calibration can only differ by —2 to +3 
per cent at the 95 per cent CL when averaged over luminos- 
ity bins. Since Fig. |7| shows how similar these distributions 
are for r < 21, this result is not too surprising. 

Next, we compare the results for the different ways of 
using photometric redshifts. First, we can consider the differ- 
ence between using photometric redshifts directly and cor- 
recting for their mean bias (but not the width of the er- 
ror distribution), shown in line 7. As demonstrated by the 
results for L3-L6, correcting for the mean bias lowers the 
signal by about 3 per cent, and the results are statistically 
inconsistent with the results without any correction for pho- 
tometric redshift error at the 95 per cent CL. We expected 
the correction for the mean bias to lower the signal because 
(5) < for low photometric redshift, so correcting for the 
mean bias will raise the assumed source redshift and EJ^, 
lowering AE (while (S) > for high photometric redshift, so 
correcting for it should raise the signal, the effect at low pho- 
tometric redshift is apparently more important since E^^ 
varies more strongly with Zp at low Zp despite the weights 
proportional to E^^ that make higher redshift sources more 
important). We next consider what happens when we go 
from including the mean bias only, to including the full er- 
ror distribution (line 13). As shown, including the width of 
the error distribution then raises the signal by 5 per cent; 
the results corrected for only the mean bias versus using the 
full error distribution (bias and scatter) are statistically in- 
consistent at approximately the 4(t level, which means that 
including the width of the error distribution is very impor- 
tant. This increase in signal when we include the scatter is 
consistent with our expectation that scatter will lead to de- 
creased E~i, at least for the values Zp > z; -|- 0.1 used here 
(at lower Zp it may actually raise EJ^). The next compari- 
son is using photometric redshifts directly versus including 
the full error distribution, line 8; consistent with combining 
the aforementioned results from lines 7 and 13, we find that 
including the full error distribution increases the signal com- 
pared to no correction at all by approximately 2 per cent, 
though the results are marginally consistent at the 2a level. 
This near agreement is due to the effects from the mean bias 
and width of the distributions cancelling out; the nearly ex- 
act cancellation was impossible to predict in advance, which 
is why this systematics test is so useful. 

We can also use these results to place approximate 2a 
bounds on the calibration uncertainty due to statistical er- 
ror in the photometric redshift error distribution. We place 
these bounds using the comparison between no correction 
at all, correction for (S), and correction for the full error 
distribution, which gives approximately ±3 per cent calibra- 
tion uncertainty (95 per cent confidence level). This result 
is comparable to the 2a Poisson errorbars (determined via 
bootstrap resampling of the photometric redshift error dis- 
tribution) which are ±2.6 per cent for lens redshift ~ Zoff,L6, 
lower for lower lens redshift; we use the ±3 per cent figure 
to include extra uncertainty due to the effects of LSS. 

Another informative test is the comparison between the 
results using the best-fit error distribution versus the ac- 
tual error distribution. When plotted against each other, 
the best-fit distribution did show some significant differences 
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from the actual one; for example, it did not account for the 
tails in the distribution very well. This comparison is on line 
11 of table lTSl as shown, use of the best-fit error distribution 
rather than the actual one increases the signal by about 3 
per cent, and the two results are inconsistent at the 3cr level. 
The sign of the discrepancy is in accordance with the results 
for in Fig. El While the magnitude of this bias is clearly 
not large, since the difference is statistically significant, it 
seems that it would be preferable to use the actual error 
distribution to avoid this bias altogether. 

Finally, we compare the results using redshift distribu- 
tions versus using photometric redshift error distributions, 
lines 4 (DEEP2) and 15 (COMBO-17). We find that the 
signal is 3 per cent lower when we use either redshift dis- 
tribution than when we use the photometric redshift error 
distributions; however, the two results only differ by about 
la, so the difference in calibration is not statistically signif- 
icant. However, as mentioned previously and demonstrated 
m Fig. there is significantly more contamination from 
physically associated pairs when we use a distribution for 
r < 21 rather than using photometric redshift information 
to remove that contamination. Since there is no clear dis- 
crepancy due to the use of the redshift distributions versus 
photometric redshifts with error distributions, but the use 
of redshift distributions can lead to higher systematic error 
due to intrinsic alignments, we will henceforth use photo- 
metric redshifts with the actual error distributions for the 
r < 21 sample when comparing against the signal from other 
source subsamples. 

Another systematics comparison that was completed 
was the computation of signal with Zp > (zp) versus with 
Zp < (zp) without correcting for photometric redshift er- 
rors. If our understanding of photometric redshift errors is 
correct, then we expect the signal at high Zp to be lower than 
the signal at low Zp by a statistically significant amount. In- 
deed, the ratio Raii,a,p = 0.79; the 3a confidence interval is 
[0.63, 1.00], so the discrepancy between the two samples is 3a 
and has the correct sign. When we correct for photometric 
redshift error, the discrepancy still exists at the 2a level; it 
is 0.86, with 2a confidence interval [0.74, 1.00]. The fact that 
correcting for photometric redshift errors helps reduce the 
discrepancy indicates that we are understanding these errors 
properly. The fact that there is still a discrepancy at the 2a 
level is not too alarming, because as mentioned previously, 
the errors used here are purely statistical. Once we add the 
overall calibration uncertainty, the discrepancy between the 
two samples is only la and therefore not of concern. 

6.4-2 Faint (r ^ 21) sources 

For the r > 21 sample, two methods of redshift distribution 
determination were considered: 

1. The redshift distributions from COMBO-17, p{z\r), 
and 

2. The average redshift distribution p{z) from DEEP2 for 
21 < r < 21.8. 

In both cases, distributions were used, so we had no way of 
eliminating physically-associated lens-source pairs; however, 
since the r > 21 sample is, on average, at higher redshift 
than the lenses, this procedure is less important than for 
the r < 21 sample. In both cases, high-redshift LRGs are 



Table 16. A comparison of the overall calibration of the signal 
Ri a where a refers to the distribution from DEEP2 and /3 
to the distribution from COMBO-17, for the r ^ 21 sources, 
30 < r < 2000 h~^kpc, all luminosity bins with significant signal. 

Lens luminosity Ri^a.f} with 95 per cent CL bounds 



L3 
L4 
L5 
L6 

L3-L6 



1 02+°-°'' 
A+0.04, 
-0.05 

1 05+°°* 
^■'^'^-0.03 
-1+0.05 
-0.05 
-+0.03 



1.04"' 



1.09^ 



1.05^ 



0.02 



included in the sample. As in the previous subsection, we 
consider results from 30-2000 /i~^kpc. Results for Ri,a,i3 are 
shown in table ITHl since only one comparison is being made, 
we show results for all luminosity bins. 

Not surprisingly, the minor discrepancy (< 2a) that ex- 
ists for L3-L4 becomes worse for the higher redshift lenses, 
since they are closer to the peak of the distribution and are 
therefore more sensitive to details of the distribution. For L5, 
the discrepancy is 3a and for L6, Aa, with the result aver- 
aged over luminosities a 5 per cent discrepancy, and the two 
signals being different at the 4a level. Consequently, while 
the difference in signals is relatively small, we are able to use 
the systematics test to determine it to very high accuracy. 
The fact that the distribution from DEEP2 gives slightly 
higher results implies that it gives on average slightly lower 
{E~^) and therefore is at lower redshift on average. This re- 
sult is not entirely surprising, since the DEEP2 distribution 
is specific to our lensing catalog, but the COMBO-17 dis- 
tribution was for photometric galaxies in general, and our 
lensing cuts will tend to eliminate higher-redshift galaxies 
more than lower redshift ones. This understanding is in ac- 
cordance with Fig. [7| which shows both distributions for 
r > 21. Consequently, for the remainder of this paper, the 
DEEP2 redshift distribution will be used for the r > 21 
sample. 



6.4-3 LRG sources 

For LRGs, the signal was computed two ways: 

1. Using photometric redshifts directly, for 0.4 < Zp < 
0.65, and 

2. Using redshift distributions p{z\zp) determined via the 
inversion procedure from iPadmanabhan et al.. C2004ii"l for 
0.45 <zp< 0.65. 

Besides these two methods of comparing the redshift distri- 
bution, we did several tests to ensure that our cuts on the 
LRG sample were optimal. Using method |(i)| to determine 
redshifts, we recompute the signal twice, once with the re- 
quirement that dx > 0.5 and then with > 0.55; these 
stricter cuts should help eliminate more of the low-redshift 
contamination (this contamination is a concern because we 
would be vastly overestimating its redshift and E~^, and 
underestimating AE). If the signal computed using these 
cuts is similar to that computed with our cut, d±_ > 0.45, 
then we can be assured that the role of low-redshift contam- 
ination is minimal. In addition, we also compute the signal 
using photometric redshifts directly from 0.45 < Zp < 0.65 
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Table 17. A comparison of the overall calibration of the signal 
when computed using the methods considered in this section for 
the LRG sources, for 30 < r < 2000 h~^kpc, all luminosity bins 
with significant signal. The error bars are 95 per cent confidence 
limits. 



Table 18. A comparison of the overall calibration of the signal 
when computed using the three main source samples, all lumi- 
nosity bins with significant signal. The results for the two radial 
ranges are, of course, highly correlated. We show i?i,a./3 a-nd the 
95 per cent CL. 



Sources a 



Sources /3 



Zp 
Zp 
Zp 



> 0.5 

> 0.55 



0.45 < Zp 
p{z\zp) 



1 02+°-°'' 
-'-■'^^-0.05 
",+0.08 
-0.07 



1.00"* 



< 0.65 0.98 
1.00 



+0.10 
0.07 
+0.10 
0.08 



to ensure consistency with the full sample, in case there is 
contamination in the 0.4 < Zp < 0.45. Results for the ratio 
test are shown in table IT7I 

We can come to several conclusions from this table. 
Lines 1-3 show the results of the systematics tests: us- 
ing photometric redshifts directly for the full sample with 
d± > 0.45 and 0.4 < Zp < 0.65, versus requiring d± > 0.5 
(line 1), d±> 0.55 (line 2), or 0.45 < Zp < 0.65 (line 3). As 
shown, we see no sign of any systematic discrepancy. This 
result is useful because it suggests that our photometric red- 
shift and color cuts are sufBcient to eliminate low-redshift 
contamination to a negligible level, so there is no need to 
impose stricter cuts such as those listed above, which also 
lower the statistical power of this subsample. Line 4 shows 
a comparison of the results using photometric redshifts di- 
rect lyJor_0;^_<_^Zg_<_0;65 versus using the inversion method 
from IPadmanabhan et all J2004ft for 0.45 < Zp < 0.65. As 
shown, the results are completely statistically consistent. 
This result is not surprising, since the LRG sample is at 
much higher redshift than the lenses, and therefore small 
changes in the redshift distribution have little effect on the 
final results. Because of the larger numbers of LRGs in the 
0.4 < Zp < 0.65 sample, we will henceforth use the full LRG 
sample with photometric redshifts directly when comparing 
against other subsamples. When considering the variation 
in calibration between the samples in table [T71 we conclude 
that the calibration uncertainty in the LRG signal due to 
redshift distribution determination is ±10 per cent at the 
2a level. 



6.4.4 All 

In this subsection we compare the results from the three 
source samples (r < 21, r > 21, and LRGs) to check that 
they are consistent. As mentioned in the previous sections, 
we will use the photometric redshifts with the full error dis- 
tributions from DEEP2 for r < 21 (without LRGs), the 
redshift distribution from DEEP2 for the r > 21 sample, 
and photometric redshifts directly for the LRG sample. 

Furthermore, in contrast to the previous sections, our 
concern here is not merely overall calibration; since the dif- 
ferent source samples have different boost factors and dif- 
ferent random catalog signals, it is important to check for 
radius-dependent effects. In principle, we could also measure 
the ratios Ri,a,p for several radial ranges. However, the S/N 
for these comparisons was not high enough to give reason- 
able errorbars. Consequently, we only consider larger ranges 
of 150-2000 /i"^kpc and 30-2000 /i"^kpc. The reason for 



Lenses 30-2000 h-^kpc 150-2000 h'^kpc 
r < 21 without LRGs versus r > 21 



L3 


1 ir, + 0.48 

^- J^^-0.40 


1 oq+'^" 

J^-Uy_o.32 


L4 


^■'■'^-0.22 


'-■"1-0.24 


L5 


-0.16 




L6 




0.85t»:- 


L3-L6 


1 03+°-" 





< 21 without LRGs versus LRGs 



L3 


1 1-1+0.93 

-0.38 


^^■'-'•^-0.38 


L4 


l.Ur_Q 27 


J^-^'J-0.33 


L5 


^■'J^-0.18 


1 (,1+0.30 


L6 


qs+0-27 


u.au_o 23 


L3-L6 


^■'J^-0.20 


1 nn+0.19 
^^•'-"^-0.16 



r > 21 versus LRGs 



L3 


n 00+0.75 


n Q4+0'79 
'^■^^-0.34 


L4 


u.yy_Q 24 


1 02+0.46 


L5 


"■^"-0.15 


0.98lE!;2| 


L6 


1.03«-JI 


i.UD_(j 23 


L3-L6 


0.98«:}? 


J^-*JJ^-0.14 



using the 150-2000 /i~^kpc range is that even for L6 it ex- 
cludes the angular separations < 90" for which the boost 
factor has the difficult to quantify systematic due to sky sub- 
traction, and it excludes the radii for which magnification 
bias may be important. Consequently, 150-2000 /i~^kpc is a 
more systematic-free ranges. Results are shown in table ITHl 
We can draw several conclusions from this table. First, 
the fact that the ratios using 30-2000 and 150-2000 h'^kpc 
are so similar illustrates that even with our concern about 
the systematic in the boosts at low values of transverse sep- 
aration, use of the full radial range does not cause obvi- 
ous problems. Second, there seems to be some luminosity- 
dependence of the overall calibration difference between the 
samples (especially r < 21 versus r > 21), but the error- 
bars on individual luminosity bins are large enough that no 
definite statement about this can be made. Finally, when av- 
eraged over all luminosities, the results for the three source 
subsamples agree well within errorbars. We see no sugges- 
tion that there is anything fundamentally wrong with any of 
these three samples, either in the shear calibration, redshift 
distributions, or other systematics, and consider the con- 
sistency of the samples to be established at the 10% level 
(la). 

Here we show plots of the signal. Fig. 1161 shows the sig- 
nal from L5 for the three samples to illustrate our finding 
from the systematics test that there is no systematic cali- 
bration offset between the three samples. Fig. 1171 shows the 
signal averaged over source samples for all 6 luminosity bins, 
with statistical errorbars only, and the results of halo m odel 
fits to the signal a s described in ISeliak et al.l and 
iMandelbaum et al.l (l2004l) . In order to allow the AE values 
shown on the plots to be converted to shear 7t, table [T^ in- 
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AS averaged over source sample 




100 1000 100 1000 

r [kpc/h] r [kpc/h] 



Figure 17. AS(r) averaged over source subsample as a function of lens luminosity. For LI and L2, which have signal statistically 
consistent with zero, the vertical scale is not logarithmic, and the zero level is shown as a dashed line; for the other luminosity bins, a 
logarithmic scale is used for AS. The errorbars shown are statistical only; the lines are the results of halo model fits as described in the 
text. 



Table 19. For reference, (Sc ^) values for all luminosity bins 
and source samples. 



Lenses 10^ (S^ ^ ), pc^/ZiM© 

r < 21 r > 21 LRG 



LI 


5.2 


5.3 


5.5 


L2 


7.2 


7.4 


7.8 


L3 


10.2 


10.5 


11.4 


L4 


13.3 


13.5 


15.4 


L5 


16.1 


15.9 


19.0 


L6 


18.1 


17.1 


22.0 



eludes (Ec ^) for each lens and source sample combination. 



7 CONCLUSIONS 

Using the systematics test, we have estimated the effects 
of redshift distribution errors and other systematics on the 
lensing signal. As a summary, table 1201 lists those system- 
atics that are significant at approximately the 1 per cent 
level or higher. Systematics are only estimated in this ta- 
ble for the redshift determination methods for each sample 
that we have selected to use in this and future works (e.g., 
we do not show results for the use of redshift distributions 
for the r < 21 sample). We must emphasize that the esti- 
mates of the calibration uncertainty due to these effects is 
generally only relevant for this work, and their effects for 
other works must be assessed using the lens/source catalogs 
used for those works. For those systematics that have some 
nontrivial dependence on lens sample and/or radial range, 
rather than listing a value in this table, we refer the reader 
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AS for L5, each source sample 




1 - 



100 1000 
r [kpc/h] 

Figure 16. AS(r) in L5 for each source sample as labelled on 
the plot. Points have a slight horizontal offset so that errorbars 
on all three signals are visible. 

to the appropriate table/section. We have not included the 
following systematic effects because they are expected to 
be at the 0.1 per cent level: atmospheric refraction, camera 
shear, the density-systematic shear correlation {5n ■ 'ysys), 
and non-uniformity of the boost factor. 

The first section of table 1^ includes a summary of the 
findings regarding the redshift distribution calibration, and 
ends with a calculation of the overall calibration uncertainty 
for each sample due to redshift distribution-related factors. 
The second section includes a list of other factors that lead 
to calibration uncertainty that have been discussed in this 
paper. The table ends with an estimate of the overall calibra- 
tion uncertainty at the 2cr level for various lens, source com- 
binations and ranges of radii. The estimates are somewhat 
rough, in the sense that they cannot include all details of 
the error estimates completed in the text, and consequently 
a better idea of the magnitudes of some of the errors can be 
found by reading the appropriate sections of the text. 

In this table, we refer to "small" versus "large" radii, 
where the distinction between the two is that on "large" radii 
the estimates only include the calibration uncertainties such 
as stellar contamination, shear calibration, and redshift dis- 
tributions, but for "small" radii, we must also be concerned 
about systematics such as intrinsic alignments, sky subtrac- 
tion effects, and magnification bias. The definition of small 
versus large radii varies by luminosity bin, and the approx- 
imate transition between the two occurs at 40, 60, 80, 100, 
140, and 200 /i~^kpc for L1-L6 respectively. 

The overall uncertainty estimates in table l20l were com- 
puted using one of several possible methods. The difficulty 
is that by definition systematic errors are those with un- 
known probability distributions, so it is not clear how to 
combine confidence intervals for multiple eff'ects. The first 
possible method, the most conservative, would be to lin- 
early add together the upper bounds for all the independent 
sources of error to get an overall upper bound, and sim- 
ilarly for the lower bound. This method does not require 



us to make any assumptions about the probability distri- 
butions for these systematic errors, most of which are very 
likely not remotely Gaussian. The second approach would 
be to assume a uniform distribution of the systematic error 
between the 2(t bounds [a, h] given in the table. With this 
assumption, we can find the mean (a + b)/2 and the stan- 
dard deviation (b—a) / \fVl for each systematic error, average 
the means and add the standard deviations in quadrature, 
and use those results to find combined 2a bounds. This ap- 
proach is also justifiable, since the use of a uniform distri- 
bution leads to rather generous confidence intervals, but the 
results are still smaller than the very conservative bounds 
from the first approach. The final approach would be to add 
the lower bounds in quadrature, and add the upper bounds 
in quadrature. For table l20l we used the bounds from the sec- 
ond method (assuming a uniform distribution); the results 
using the first method are about 50-100 per cent larger, and 
using the third method are about 0-10 per cent larger. 

In this paper, we have sought to understand the vari- 
ous sources of calibration uncertainty and other systemat- 
ics in the galaxy-galaxy weak lensing signal computed us- 
ing our lens and source catalogs. As more data is collected, 
we will be able to perform these tests to constrain the sys- 
tematics with greater precision. The understanding we have 
gained from the systematics test will allow us to use this 
catalog for many scientific applications (studies of the rela- 
tion between mass and luminosity, mass and stellar masses, 
halo profiles as a function of environment, and others) with 
an understanding of the implication of systematic errors on 
the results. Similar investigations of systematics will also be 
needed for weak lensing auto-correlation analyses, once the 
statistical errors become as small as they are for our SDSS 
galaxy-galaxy lensing analysis. 
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Table 20. Summary of the main systcmatics investigated (and found to be significant at the 1 per cent level or greater) in this work, 
and estimates of their significance. Bounds on the error are given at the 2cr level for each effect and for all systematics combined. 



Systematic 


Lenses 


Sources 


Radii 


2(T bounds 


Redshift distribution calibration 


Error in p(5\zp) 


All 


r < 21 


All 


±0.03 


Redshift failures from DEEP2 (table EJ 


All 


r > 21 


All 


[-0.14, +0.025] 


Statistical error in p{z) 


All 


r > 21 


All 


Table El 


LRG redshift distribution failure 


All 


LRGs 


All 


±0.10 


Overall redshift calibration 


All 


r < 21 


All 


[-0.03,0.03] 


Overall redshift calibration 


All 


r > 21 


All 


[-0.14,0.07] 


Overall redshift calibration 


All 


LRGs 


All 


±0.10 


Other systematic effects 


Shear calibration bias ('i|2.2.2i 


All 


r < 21 


All 


[-0.05,0.12] 


Shear calibration bias 


All 


r > 21 


All 


[-0.08,0.18] 


Shear calibration bias 


All 


LRGs 


All 


[—0.06, 0.19] 


Stellar contamination fil6.3.3i 


All 


r < 21 


All 


[-0.040,0.0] 


Stellar contamination 


All 


r > 21 


All 


[-0.096, -0.003] 


Stellar contamination 


All 


LRGs 


All 


[-0.068,0] 


Intrinsic alignments ('>lfci.3.8i 


L3-L6 


r < 21 


All 


Table [HI 


Intrinsic alignments 


L3-L6 


r > 21 


All 


Table El 


Intrinsic alignments 


L3-L6 


LRGs 


Small 


Table [HI 


Magnification bias fil4.7.3l 


L3-L6 


r < 21 


Small 


Table El 


Magnification bias 


L3-L6 


r > 21 


Small 


Table El 


Magnification bias 


L3-L6 


LRGs 


Small 


Table El 


Photo sky level error ("i|6.3.7l 


All 


All 


Small 


[-0.15,0] 


Overall uncertainty 


L1-L4 


r < 21 


Small 


[-0.15,0.16] 


Overall uncertainty 


L1-L4 


r < 21 


Large 


[-0.09,0.11] 


Overall uncertainty 


L5-L6 


r < 21 


Small 


[-0.41,0.42] 


Overall uncertainty 


L5-L6 


r < 21 


Large 


[-0.13,0.14] 


Overall uncertainty 


L1-L4 


r > 21 


Small 


[-0.22,0.21] 


Overall uncertainty 


L1-L4 


r > 21 


Large 


[-0.19,0.19] 


Overall uncertainty 


L5-L6 


r > 21 


Small 


[-0.44, 0.44] 


Overall uncertainty 


L5-L6 


r > 21 


Large 


[-0.27,0.26] 


Overall uncertainty 


L1-L4 


LRGs 


Small 


[-0.18,0.19] 


Overall uncertainty 


L1-L4 


LRGs 


Large 


[-0.16,0.18] 


Overall uncertainty 


L5-L6 


LRGs 


Small 


[-0.21,0.23] 


Overall uncertainty 


L5-L6 


LRGs 


Large 


[-0.16,0.18] 
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APPENDIX A: FOR REDSHIFT 
DISTRIBUTION FITS 

The redshift distribution fits were obtained by minimizing 
the "x^" function 



X 



-2^1np(z,), 



(Al) 



where A^s is the number of spectra, andp(2i) = (dP/dz)!^^^. 
is the normalized redshift distribution. Equation lAH is a 
"X^" in the sense that the likelihood for a given model p{z) 
is proportional to e"-^ ^'^ in the case where the galaxy red- 
shifts are independent. While minimizing x^ is a reasonable 
way to get the redshift distribution, it is not trivial to test 
for a "goodness of fit" since (i) even in the case of indepen- 
dent redshifts. Equation lAH does not follow the standard 
X'^ distribution, and (ii) there is large scale structure (LSS) 
that introduces correlations between different redshifts, par- 
ticularly in a narrow survey such as DEEP2. This Appendix 
is devoted to calculating the statistical properties of x^ and 
the fit parameters obtained via its minimization. 

Al Mean and variance of x^ 

The mean of Eq. is 



= ix') = 2NsS, 



(A2) 



where the "entropy" S is the expectation value of — lnp(2) 
for a single galaxy, i.e. 



S - 



p{z) lnp{z) dz. 



(A3) 



The entropy is a property of the redshift distribution only; 
e.g. for the F-distribution (Ea. l40ll . 

S = a + \nV{a)~{a-l)il){a)+\nZs, (A4) 

where ^{a) = d[lnr(Q)]/da is the digamma function. Note 
that S can be positive, zero, or negative depending on the 
distribution; it is larger for wide distributions. 

Equation 1A2I I is the mean of a simple sum over galaxies 
and hence it is valid regardless of LSS. The variance is more 
complicated; it is 

al. = ((x' - (x^))^) = 4 {[Inp(20 + 5][Inp(2,) + S\). 

(A5) 

The expectation value here contains two galaxy redshifts, 
and hence depends on both the redshift distribution and 
the redshift correlation function. Defining the redshift cor- 
relation function by 

p{zi,Zj)\-_^^ ^p{zi)p{zj)[l + ^4zi,Zj)], (A6) 

we find 



where 



and 



= msSi^i+4Ns{Ns - 1)S2, 



Si,i = / p{z)[lnp(z) + S]^dz 



(A7) 
(A8) 



S2 = J p{zi)p{z2)^z{zi,Z2)[lnp{zi) + S] 

x[lnp{z2) + S]dzidz2. (A9) 



The "independent redshift" contribution to 0-^2 comes from 
the 51,1 term, which does not involve the correlation func- 
tion. For the F-distribution, 

Si,i = {a-lfiP'{a)-a + 2. (AlO) 



A2 Correlation function and S2 

The evaluation of the clustering contribution ^2 to the vari- 
ance of x^ is more complicated. We begin by assuming that 
the three-dimensional correlation function of the sources is 
a power law, ^(r) = (ro/r)^. Written in polar coordinates, 
this becomes 



ari,r2,0) 



[{n-r2)^ + rf9^p/2'' 



(All) 



where the approximation is valid for small 9 (the relevant 
case here). The correlation falls off at large |r-i — r2| as |ri — 
r2|~^, hence for 7 > 1 the integral is finite and ^ is non-zero 
only for ri ~ rg. We thus make a Limber approximation. 



^{ri,r2,9) ~ r(]5(ri - 7-2) 



dAr 



(Ar2 +r26l2)7/2 



= B (^^, i j r^oiriO)'-'S{ri ~ r2). (A12) 

The resulting correlation function ^2(21,22) is obtained by 
converting from comoving distance to redshift, and averag- 
ing over the angular separations 9 in the survey, 



^z{zi,Z2) = A{{ril 



\l-7 



dz 
dr 



5{zi — 22), 



(A13) 



where for the purposes of obtaining a rough estimate of the 
importance of S2 we assum( 
slant.* The result for S2 is 



importance of S2 we assume A = B{^^-^, |)ro to be a con 



S2 = Aie'--") J [p{z)Yr'-''H{z)[lnp{z) + SYdz, (A14) 

where we have used dz/dr = H{z). We may find A and 7 
by comparing to the observed angular 2-point correlation 
function of the galaxies; the theoretical prediction is 

Lu{9) = I ^(ri,r2,0)^(ri)^(r2)dridr2 



A9 



1-7 



[p{z)Yr'~^H{z)dz. 



(A15) 



The angular correlation function in SDSS dConnoUv et alJ 
2002) is fit by a power law with 7=1.7. The DEEP2/SDSS 
overlap region can be approximated by a rectangle of dimen- 
sions 0.25 X 0.6 deg, hence averaged over all pairs of points 
in this region (obtained via the obvious Monte Carlo proce- 
dure) is (6|i-^) ^ 3.7deg-° 



* Technically Eq. lA13t should contain an additive constant 
to account for the integral constraint J p(zi, Zj)dzidzj = 1 in 
Eq. 1A61 . An additive constant in f z would not affect 52 because 
its contribution in Eq. IA9I multiplies J p(zi)p{z2)[lnp{zi) + 
S'][lnp(22) -|-S']d2id22 = 0. 
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A3 Parameter uncertainties 

The uncertainty of the redshift distribution parameters is 
greater than the naive result from Ax^ contours because 
of source clustering. We begin our analysis of this effect by 
assuming that the redshift distribution depends on a vec- 
tor of M parameters a — {a^...a^'')\ in the case of the F- 
distribution, M = 2 and a — {zs,a). We are then estimating 
the parameters a by minimizing x^(a). We denote the true 
set of parameters by ao. 

We begin by Taylor expanding around ao, 



X^(a) = x^(ao)+G-(a-ao) + (a-ao)-K(a-ao) 



(A16) 



where G and K are the derivatives of x^ at ao. The minimum 
value of a in this approximation is then 



ao 



(A17) 



We can calculate G and K in terms of the measured source 
redshifts, 



-2E 



and 



lnp(2:,|a' 



■ lnp{z 



(A18) 



(A19) 



We now estimate the statistical properties of a using 
Eq. <A17L approximating K by its expectation value for 
simplicity: 



= iV, 



lnp{zi 
dlnp{zi) dlnp{zi) 



9af 



(A20) 



where we have suppressed the argument a in p{zi) for sim- 
plicity of notation.^ 

The expectation value of G,! is easily verified to be 
(Gil) — 0, and so we have (a) — oq. The covariance of 
G is 



(G^G.) = 4 ^ 

»>3 = 1 

= 4^-, 



d\np{zi) d\np{zj 



da 



d\np{zi) d\np{zj) 



(A21) 



(The i = j terms become Jf^i/, following the argument of 
Ea. lA20l 'l The second term can be evaluated using Eq. 1A6II 
and Eq. lIXTsll to yield 



(G^G.) = 'iK^^+4Ns{Ns-l)A{e^-'') 

X lr'-^H{z)^^dz. 
J ^ ' daf^ aa" 



(A22) 



^ The second equality in Eq. iA20i can be found by 
differentiating the relation J p(z)dz = 1 twice to get 
{[d'^ p{z) / dai^ da'^] / p{z)) = 0, and then re-writing this result in 
terms of Inp(z). 



The covariance matrix of a is then 
1 
4 



dp dp 



X 



da^ daP 



dz, (A23) 



which is larger than the covariance matrix K^^ that would 
be obtained via the usual Ax^ = 1 prescription. 

In the case of the F-distribution with parameters {za, a), 
we find 



(A24) 



APPENDIX B: NON-GAUSSIAN CONFIDENCE 
INTERVALS FOR CORRELATED VARIABLES 

In this section, we explain the method used to compute non- 
Gaussian confidence intervals on the ratio of correlated vari- 
ables. First, we consider the case of the ratio of two uncor- 
related variables with Gaussian noise, then generalize to the 
correlated case. 

When computing the error on the ratio of two vari- 
ables R = y/x, where y and x are Gaussian variables 
with standard deviations ay and a^, the naive result an = 

R\j [fyOylyf' + ((Ta,/a;)^) is unreliable in the case when x 
may be consistent with zero even at the many-cr level. In- 
stead, the non-Gaussian confidence intervals should be used. 
If we want to find the confidence intervals on R at the za 
level (i.e. z = \ woul d give the 68 per cent confidence inter - 
val), then the result llBhsjll935al: lBhsjll935ht iFiellejIlOsJ) 
is as follows: 



± limit on ii = 



^A2 



where we define 



+ 



(Bl) 



(B2) 



If y and x are correlated, with nonzero correlation co- 
efficient p = Cov(2;, j/)/((Ji,aj,), then Eq. IBH is no longer 
valid. In order to get confidence intervals on R, we do the 
following manipulations: we change variables so that we are 
once again dealing with two uncorrelated variables, in this 
case w = y — mx, where m = pay /ax. Here w is clearly 
uncorrelated with x, and has error a^ = a^il — p^). Fur- 
thermore, R = w /x + m. We can then use the procedure for 
uncorrelated variables to compute confidence intervals on 
w/x, so that the upper and lower limits on y/x are related 
to those on w/x by the addition of m. 



APPENDIX C: CALIBRATION BIAS FROM 
SIGNIFICANCE CUT 

There is a contribution to the shear calibration bias origi- 
nating from selection effects from the signal-to-noise cut in 
our catalog. We can investigate the magnitude of this ef- 
fect by the same methods used in H2.2.2I Photo convolves 
the observed image / with the PSF P to create a new image 
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J = I(^P, and finds pixels in P above some threshold Jc, set 
to vw[J{x)] above the sky background, where the signal-to- 
noise threshold is vt = 5. This convention is chosen because, 
in the case where P is symmetric under 180 degree rotation, 
J{x) represents the optimal statistic for searching for a point 
source at x. Objects above Jc are designated as BINNED 1 
by Photo; for detection, we require that this flag be set in 
the r and i bands. The threshold is 



0A7vtn^''^e-]^^j^ (Gaussian) , , 

Q.^7vtn^^^e-\^jjj,,j (Kolmogorov) ' ^ ' 

where n is the noise variance per pixel and Ofwhm is 
the full-width at half maximum of the PSF. We have 
shown both the case of the Gaussian PSF and the Kol- 
mogorov turbulence-induced PSF InP(fc) oc k^^^. For typ- 
ical parameters, Jc — 0.13 nmgy arcsec"^ in r band and 
0.22 nmgy arcsec"^ in i band. 

The value of J(0), where we have translated the object's 
centroid to the origin 0, is given by 

J(0) = J /(a;)P(a;)d^x = J f{x)K{x)d^jc, (C2) 

where K = P ® P and we have assumed P{x) — P{~x). 
The simplest way to evaluate J is in the case of a circular 
Gaussian PSF and elliptical Gaussian galaxy, for which 

J(0) = ^ [det (m<^'' + 2M(-^')] , (C3) 

where F is the galaxy flux. Under a shear leaving cr'--'^' con- 
stant, this varies by 

5J{0) _ 2(l-7?2)e-7 



J(0) 4R-^ -i + R2-e^ 



(C4) 



The resulting shear selection bias can then be computed by 
the same method as was used in H2.2.2I It yields 



S-y _ / (1 - i?2)e^ \ dN{j) 



7 \(47^^'-4-|-i?2-e2)7^/ dlnj 



(C5) 



where N{j) is the (weighted) fraction of galaxies with 
J(0) < j. The coefficient of dN{j) /din j has a mean value 
(averaged over the R2 distribution) of —0.024, —0.028, and 
—0.026 for the r < 21, r > 21, and LRG samples respec- 
tively in r band, and -0.023, -0.026, and -0.024 in i band. 
Conservative values of dA'^(j)/dlnj (obtained by maximiz- 
ing over j, since we have not calculated the threshold ex- 
actly) are 1.40, 2.37, and 1.42 respectively in r band, and 
1.23, 2.01, and 1.46 in i band. We conservatively estimate 
the calibration bias due to the signal to noise cut using the 
r band values, and obtain —0.034, —0.066, and —0.037 for 
r < 21, r > 21, and LRG samples respectively. 



